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Preface 


This is an intermediate-level textbook on electricity and magnetism. It is intended 
to be used for a two- or one-semester course for students of physics, engineering, 
mathematics, and other sciences, who have already had a one-year introductory 
physics course with calculus. 

The book is flexible enough to be used in several ways: (1) The traditional two- 
semester course would cover electrostatics and magnetostatics in the first semester 
using Chapters 1-8; and then magnetic materials and time-dependent fields in the 
second semester using Chapters 9-15. (2) An instructor teaching a one-semester 
course could cover all the basic principles of electromagnetism by using Chapters 
1-3 and 6-11; there might also be time for a few examples from Chapters 4 and 5. 
(3) An interesting alternative approach in a two-semester course would be to go 
over the basic principles of Chapters 1-3 and 6-11 in the first semester, and then 
applications and advanced topics in the second semester based on Chapters 4,5, 
and 12-15. 

The total material in the book is more than could be realistically covered by any 
instructor, even in two semesters. Instructors are encouraged to pick and choose 
based on their own judgment of what is important. Electricity and magnetism is a 
wonderfully interesting subject, but to students at the intermediate level its phys¬ 
ical concepts are non-intuitive, and the associated mathematical techniques are 
new and challenging. Therefore it’s important in teaching this subject to avoid the 
kind of heroic pace which will tire out all but the strongest students and instruc¬ 
tors. The general principle that in teaching it’s better to uncover a little than to 
cover a lot, applies to this subject of course. 

The order of presentation of subjects is the traditional one: electrostatics first, 
then magnetism, electrodynamics and Maxwell’s equations, relativity, and radia¬ 
tion. Chapter 2 is an introductory treatment of vector calculus, which should help 
students acquire the necessary mathematical armamentarium. Our experience in 
teaching this subject is that at the outset of the course most students do not know 
vector calculus well enough to study electromagnetic field theory, so it’s impor¬ 
tant to help them gain the necessary mastery. Chapter 2 is sophisticated in places, 
and it is not necessary to comprehend all of it before starting on Chapter 3; the 
student can return to Chapter 2 when additional mathematical skill is needed. Stu¬ 
dents might also read a specialized book on vector calculus (e.g., one of the two 
references at the end of Chapter 2) while studying Chapter 2. 

We have given an extensive treatment of electrostatics, in Chapters 3-6. The 
topics treated later in the book are more interesting than electrostatics to many stu- 
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dents and instructors, so there’s a tendency to hurry into them. But our experience 
is that time invested studying electrostatics pays dividends later on, because stu¬ 
dents acquire in electrostatics mathematical skills and confidence that are needed 
for other areas of electromagnetism. 

Many good textbooks have been written on electromagnetism, and thousands 
of students have learned the subject from them. The two authors have taught this 
subject to hundreds of students over several decades, using some of those earlier 
books. It was from those many interactions with students, as well as with our 
colleagues and teachers, all remembered with pleasure, that we were led to write 
this book. 

What is special about this book? For the most part it is a traditional, even con¬ 
ventional, exposition of electromagnetism, but we have also done three things 
we believe are important, and not stressed quite enough in other textbooks. First, 
we have tried to show how the mathematical principles that students are study¬ 
ing are used in modem technology—i.e., in real applications that students en¬ 
counter in science and everyday life—applications such as cellular phones, optical 
fibers, magnetic resonance imaging, and charged particle accelerators and detec¬ 
tors. How is Faraday’s Law related to the electricity in a wall socket? How can we 
calculate the interaction between radio waves and the ionosphere? Although it is 
necessary to study highly idealized, academic examples in field theory—e.g., the 
magnetic field of an infinitely long wire carrying a constant current, or the electric 
field in a spherical capacitor—students should also learn that the theory describes 
real physical phenomena and devices. The ideal cases are not the whole story. 

Second, we have included in the text many worked-out mathematical examples 
in each area, including some examples that go beyond the elementary, exactly 
solvable, ones, and other examples that require multi-step analysis such as the 
use of the superposition principle. This book is based on a two-semester course 
that we have taught at Michigan State University. The course is taken by senior 
undergraduate students, and some first-year graduate students who are not yet en¬ 
rolled in the graduate-level course. Our experience is that for students to master 
the intermediate level they must study more than just the simplest cases. Even 
to ’’uncover a little” requires that the instructor show a variety of examples. Each 
chapter starts at an elementary level, with topics the student is likely to know from 
an introductory physics course. The discussion then leads into the junior/senior- 
level material which is the heart of the book. At the end of several chapters we’ve 
introduced an interesting, more advanced, subject; we hope this will inspire stu¬ 
dents to future further study of the subject by indicating what lies beyond the 
horizon. 

Third, we have included a number of computer calculations, both in the text 
and in the end of chapter exercises. Computer software that integrates analysis, 
numerical calculation, and graphics, e.g., Mathematica, Maple, Mathcad, Matlab, 
Excel, etc., can be used for these calculations. Students who are comfortable with 
Fortran or other computer languages can do the exercises by writing their own 
programs. Much of current physics, both experimental and theoretical, is done 
with computers, so today’s students need this experience. On the other hand, the 
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computer cannot replace the understanding of theoretical principles. We regard 
the computer exercises as an important, but not dominant, part of the book. 

An essential part of this subject is learning how to do problems. There is a ten¬ 
dency (maybe it’s even a tradition) for some textbooks on this subject—Smythe’s 
book being the most dramatic example—to give many very difficult problems, on 
which the student spends uncounted and often frustrating hours. Many practicing 
physicists, the authors included, have been brought up in this draconian school. 
We believe however that the principle of ’’all things in moderation” should be ap¬ 
plied to E&M problems. At the end of each chapter we have given a number of 
exercises of various degrees of difficulty, but mostly of only moderate difficulty, 
which are intended to help students understand the subject. Hints and answers are 
given in many of the exercises. 

In writing Electromagnetism we had in mind the learning needs of present-day 
students, who are in some ways ready for a deeper understanding of the subject 
than we, their instructors, were at their age. They have had sophisticated courses 
in mathematics, even if they are still learning to apply this mathematics; we there¬ 
fore use advanced mathematics freely, but also give generous explanations, so that 
students can exercise these valuable, newly acquired, skills. Mastering the subject 
at the intermediate level is not easy, and we encourage students to discuss what 
they are learning with other students and with their instructors at every opportu¬ 
nity. 1 Learning and doing physics has a social component. One learns much more 
from such discussions than by reading a textbook alone. 

We believe it is especially important to meet the needs of those students who 
will go on to further study of electrodynamics, for example in graduate school in 
physics or electrical engineering, or who will use these principles in industry or 
engineering. Among these are the men and women who will write the next gen¬ 
eration of books on this and related subjects, invent and develop new applications 
and (who knows?) discover new principles. To this end we try to extend the stu¬ 
dents’ knowledge to a high enough level that they will be adequately prepared for 
working in J. D. Jackson’s Classical Electrodynamics, or similar advanced books. 

We would like to thank our students for listening to so many of our lectures on 
electromagnetism. We also owe a great debt to our own teachers of this subject, 
among them, R. P. Feynman, M. Firebaugh, R. C. Garth, D. L. Huber, F. E. Low, 
W. Mais, and W. R. Smythe. 


'“Many will range far and wide and knowledge will increase.” Daniel 12.4. 
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This Section together with Chapter 1 might appropriately be called “Loomings”, 
in the sense that “Loomings” is the title of Chapter 1 of Moby Dick 2 , because 
the purpose here, as there, is to give background and indicate an approach to a 
large subject. In any case, natural looming, which is observed in the atmosphere 
over sea or land, is an optical, i.e., electromagnetic, phenomenon. It is caused by 
variation of the index of refraction of air with temperature and height. Looming 
makes objects like ships and lighthouses look larger and more threatening than 
they really are. There is something like that effect when one first approaches the 
study of electromagnetism, and is confronted by a very large subject. But, as in 
optical looming, the size is not really so large. After all, there are only 4 Maxwell 
equations. 

Although learning electromagnetism is a great intellectual challenge - mas¬ 
tering any subject is difficult of course - there’s nothing in it that should deter 
serious students. On the contrary, in our study of the subject we will encounter 
many interesting ideas, physical phenomena, and mathematical techniques; and 
many of these things have great beauty! It’s a challenge less dangerous than the 
voyage of the Pequod. A good thing about the electromagnetism voyage is that it 
won’t end. 

In this Section the authors offer some suggestions to smooth the way of learn¬ 
ing. 

How to study this book. In learning a new subject, especially one with a lot of 
mathematical content, it’s a good idea to take it in small increments. In this book 
we’ve tried to limit the ideas/page, i.e., the idea density, to a value for which 3-5 
pages per learning session is about right. It is good practice to read the material 
before going to class. This first reading can be casual, without paper and pencil. 
Then the instructor’s presentation will be more understandable. But then it is nec¬ 
essary to reread the material, this time slowly and carefully with paper and pencil 
at hand. Try to fill in the intermediate steps in the calculations. When you get 
mired down, which is inevitable in theoretical physics, don’t spend an inordinate 
amount of time struggling with a particular step. Pick up the argument wherever 
you can and go on. 

Worked-out examples. The Examples in the text illustrate application of the 
principles to classic problems. They range from elementary to sophisticated. The 

2 Herman Melville, Moby Dick, 1851. 
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latter introduce new ideas. We suggest that you read each example line by line, 
so that you are able to repeat the calculation on your own and apply the ideas to 
other problems. 

Formal language. Informal language plays an essential role in learning and ex¬ 
changing information. We all use it most of the time. But more formal language, 
which is used in this and similar books for proofs, derivations, and statements of 
problems, is also necessary. It is important not to be intimidated by the formalism, 
whose purpose is to avoid confusion or ambiguity. You’ll get used to it, and after 
reading a few chapters you’ll feel comfortable with it. 

Exercises. In the end, most of what you leant will have come from doing ex¬ 
ercises yourself. 3 Because so much time and effort must go into this process, the 
experience should be as pleasant as possible. Therefore, exercises are provided at 
the end of each chapter. Ideally, you should try to understand and solve all the 
end-of-chapter exercises. 

A common student difficulty in using textbooks is “to understand the material” 
but not be able to do the problems. There’s a gap between what is learned from the 
text and what is needed to do the exercises. There is no easy fix for this difficulty, 
but we have tried some things to help bridge the gap: Some exercises are footnoted 
in the text where they are related to the text. Some exercises are grouped together 
by the appropriate section of the chapter. Of course the best exercises require 
material from many sections, and a wider perspective. Those exercises are more 
like the problems that arise in research in science and engineering. 

The end-of-chapter exercises include a range of difficulty, but we have tried to 
omit the very difficult kind that only a few students can solve, and then only after 
a Herculean effort. 4 Although it is crucial to work on the exercises yourself, it is 
also useful to discuss them with classmates, instructors, or interested bystanders. 
There is a social aspect of science, and you will leam a lot by interactions with 
other scientists. Please eschew the “method of multiple books” - looking for the 
solution of an exercise in another book. That takes more time than just figuring it 
out yourself. 

If you get stuck on some problems, console yourself with the thought that this 
happens to everyone. The knowledge you are seeking has taken hundreds of years 
to accrue, developed by thousands of people who made the same mistakes and 
false starts as you. So, if you feel hopelessly stuck, then close the book, contem¬ 
plate the problem at leisure, talk it over with some other people, and try again 
tomorrow. 5 

3 Most physicists will say: “Everything I know I taught myself!” What they mean is that they worked 
out a lot of problems, by the sweat of their own brows, and that’s how they learned the subject. 

4 If you want to try some really difficult problems, they can be found in more advanced books like 
Smythe or Jackson. 

5 You can take encouragement from the Midrash Rabba (Fifth century) which says: 'There isn’t a 
blade of grass in the world below which doesn’t have a star and an angel above that strike it and tell 
it, ‘Grow! Grow!’ ” 
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XIX 


Introductory textbooks. This book is intended for students who have already 
taken an introductory college-level course on physics. All introductory textbooks 
cover electricity and magnetism, in quite a lot of detail, including the mathemati¬ 
cal theory. The textbook used in your introductory course should be used as a ref¬ 
erence book when you encounter a difficult topic in this intermediate-level book. 
(In the same way, when you study electromagnetism in a graduate course, you will 
refer back to this book when you encounter a difficult topic in the advanced-level 
graduate text.) Reading over the material at a slightly lower level of sophistication 
can be very helpful in understanding the new ideas. 

A list of good introductory textbooks is given at the end of Chapter 1. 




CHAPTER 



History and Perspective 


“The theory I propose may therefore be called a theory of the 
Electromagnetic Field, because it has to do with the space in the 
neighborhood of the electric or magnetic bodies 

James Clerk Maxwell, Transactions of the Royal Society, 1864. 


The electromagnetic interaction is one of the fundamental interactions of the 
physical world. It is the basic interaction in atoms and molecules. In nature it 
manifests itself in sunshine, lightning, rainbows, and many other phenomena, so 
that its study is as old as the attempts to understand such common observations. In 
technology the study of electromagnetism is as new as techniques for communi¬ 
cating with NASA’s planetary probes, electromagnetic medical imaging devices, 
computer electronics, and other modem marvels. The subject will no doubt be re¬ 
newed by the understanding and future applications developed by serious students 
of science and engineering, like you, for whom this book is intended. 

Electromagnetism is important in all areas of physics. Section 1.2 of this chap¬ 
ter describes the place of electromagnetism in the “Standard Model” of the fun¬ 
damental interactions. But we begin now with a brief history of our subject. 


1.1 ■ BRIEF HISTORY OF THE SCIENCE OF ELECTROMAGNETISM 

Electric and magnetic phenomena have been known for millenia. The earliest 
examples were forces produced by static electricity and by ferromagnetism. Of 
course there are other phenomena that we recognize today to be of electromag¬ 
netic origin, which have been observed since the beginning of time. For example, 
lightning is an electric discharge. Also, light consists of electromagnetic waves, 
or, in the quantum theory, photons. But before the scientific revolution it was not 
recognized that these varied phenomena have a common origin. During the 19th 
century a unified theory was constructed, describing electricity, magnetism, and 
optics, based on the electromagnetic field. 

The word “electric” was coined by William Gilbert 1 as a name for the phe¬ 
nomena of static electricity. The origin of the word is from the Greek “electron” 
(r]XeKzpov) which means amber. Amber is a fossilized plant resin, an electrical 

1 William Gilbert was bom in 1544 in Colchester, England, and died in 1603. 
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insulator which, when rubbed with animal fur, acquires the ability to attract small 
bits of matter such as straw or paper. This primitive example of static electricity 
was known to the philosophers of ancient Greece. 

The word “magnetic” was also coined by Gilbert as a name for the phenomena 
of ferromagnetism. The origin of this word is the Greek place-name “Magnesia,” 
which was a region of ancient Greece where magnetic iron ore (magnetite) oc¬ 
curs in the Earth. The forces between natural magnets were also known to the 
philosophers of ancient Greece. 

These earliest observed electric and magnetic phenomena must have seemed 
interesting (in the way of a mystery) to philosophers, but they had no practical 
importance. In contrast, the scientific study of electricity and magnetism during 
the past 400 years has dispelled much of the mystery, and created technological 
power that the ancients could not have imagined. 

The study of electricity and magnetism was an early part of the scientific revo¬ 
lution. In 1600 Gilbert published an important book, De Magnete, one of the first 
scientific works on electricity and magnetism. It was Gilbert who demonstrated 
that a compass needle points north because the Earth itself is a large magnet ex¬ 
erting a magnetic force on the magnetized needle. Regarding electricity, Gilbert 
discovered that many substances besides amber exhibit static electricity from fric¬ 
tion, including glass, sulphur, sealing wax, and gemstones. 


Static Electricity and Current Electricity 

The science of electricity progressed significantly during the 18th century, be¬ 
cause of two technical advances. Electrostatic devices were invented that create 
and store large amounts of charge, i.e., static electricity. Also, electrochemical 
cells were developed, the forerunners of today’s batteries, to create controlled 
electric currents. 

DuFay 2 discovered that there are in fact two kinds of static electricity. He 
named these forms resinous, produced by rubbing such materials as amber; and 
vitreous, produced by rubbing glass. Today we call these negative and positive 
charge, respectively. The theoretical models that were developed at that time to 
explain static electricity were based on the assumption that electricity is some kind 
of “fluid” distinct from matter. Two models were in competition, the one-fluid 
model and the two-fluid model. Franklin 3 invented the one-fluid model, which 
supposed that neutral matter possesses a certain amount of the electric fluid, and 
positive charge is an excess of the fluid, negative charge a deficiency. On the other 
side, the two-fluid model assumed that there are two different fluids—one posi¬ 
tive, the other negative—and neutral matter contains equal amounts of the two. 
The modem understanding of electricity emerged much later, with the discovery 
of the electron by Thomson 4 in 1897: Charge is a property of matter, residing in 
the elementary particles—electrons and protons—that compose atoms. 

2 Charles Francois DuFay, b. 1698. d. 1739 

■^Benjamin Franklin, b. 1706, d. 1790 

4 J. J. Thomson, b. 1856, d. 1940 
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The definitive experiment on the force between charges was done by Coulomb 5 
in 1785. He devised a delicate torsion balance for measuring small forces accu¬ 
rately. Using this balance, he determined that the force between small charged 
objects is inversely proportional to the distance squared, and proportional to the 
charges, F oc qxqifr 1 . 

Electric current—the flow of electric charge—occurs in discharges of static 
electricity. Lightning is an example in nature. But accurate experiments with cur¬ 
rent were not possible until the invention of the battery, which provides a source of 
continuous current. A battery is a chemical reactor that produces a constant elec¬ 
tromotive force between two electrodes. The battery was developed by Volta, 6 
following the discovery by Galvani 7 that if different metals are in contact with an 
aqueous salt solution then there is an electric current. Galvani’s original discov¬ 
ery was in animal tissue (frogs’ legs to be precise) and he believed that galvanic 
effects were at least partly biological. But Volta showed that current could be 
produced with purely physical samples. By constructing a “pile” of metal plates, 
alternating between different metals, immersed in acid or salt solution, large cur¬ 
rents can be produced. This discovery made available a new source of electricity, 
in the form of continuous current. The first suggestion that the forces holding 
atoms and molecules together are electric forces came from Carlisle and Nichol¬ 
son 8 in 1800 who passed a current through salt water and obtained hydrogen and 
oxygen gases. Current electricity was a necessary prerequisite for the advances in 
the understanding of electricity and magnetism that followed in the 19th century. 


Unification of Electricity and Magnetism 

In 1820, during a public lecture on electric and magnetic ^.,^cts, Oersted 9 discov¬ 
ered that electric current in a metal wire produces deflection of a compass needle. 
This accidental discovery was the first indication that electricity and magnetism 
are related. In modem terms, Oersted had found that an electric current produces 
a magnetic field. Within a few days of the discovery, Biot and Savart 10 had repro¬ 
duced the effect in their laboratory, and found that the strength of the magnetic 
field varies inversely with the distance from the wire. As to direction, by tracing 
the magnetic force in the region of the wire they showed that the magnetic field 
curls around the current. 

Ampere" studied in great detail how electric currents produce magnetic ef¬ 
fects. His ultimate result was an equation for the action of one current element 


5 Charles Augustin de Coulomb, b. 1736, d. 1806 
6 Alessandro Volta, b. 1745, d. 1827 

7 Luigi Galvani, b. 1737, d. 1798 

8 Anthony Carlisle (1768-1840) and William Nicholson (1753-1815) built the first battery in Britain, 
made from British coins and wet cardboard. 

9 Hans Christian Oersted, b. 1777, d. 1851 

10 Jean-Baptiste Biot (1744-1862) and Felix Savart (1791-1841) 

11 Andre Marie Ampere, b. 1775, d. 1836 
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I\dl\ on another hd.f, 2 \ the force on the first element has the form 
JFi = Cl\di i x (hd £-2 x r') /r 2 

where C is a constant that depends on the system of units, ("r is the unit vector 
in the direction from dt 2 to d£\ and r is the distance between them.) The student 
may recall that the cross product of two vectors points in a direction perpendic¬ 
ular to both. Ampere’s result, which involves two cross products, anticipates that 
magnetic fields and forces may be a little tricky to figure out. 

If electric current produces magnetism, then a natural question is whether mag¬ 
netism can produce an electric current. Faraday 12 began experiments on electric¬ 
ity and magnetism in 1821, and he discovered electromagnetic induction in 1831. 
Magnetism does produce an electromotive force when the magnetic field is chang¬ 
ing in time. The same effect was discovered at about the same time by Henry 13 in 
America. 

After these observations relating electricity and magnetism, the challenge was 
to develop a theory that could account for all the effects in a unified way. The 
theory that finally succeeded was field theory. 


The Electromagnetic Field 

Faraday, from his years of experimentation with electric and magnetic phenom¬ 
ena, had developed a theoretical picture in which the phenomena were created by 
something outside the charges and currents. Faraday was self-educated, and more 
interested in new experimental results than advanced mathematical theories. His 
thinking was not limited by the existing theories of action at a distance that had 
been used to describe the forces between charges and between currents since the 
time of Coulomb and Ampere, in analogy with Newton’s theory of gravity. Fara¬ 
day called his alternative idea “the lines of force,” by which he meant a physical 
entity that extends throughout space, outside the charge or current, and affects 
other charges or currents. This field concept is the opposite of action at a dis¬ 
tance: The force on a charge or current element is due to the field at the location 
of the charge or current element. 

Faraday’s concept of the lines of force was put into precise mathematical form 
by Maxwell. 14 Maxwell published his equations in 1864, and we use the same 
equations to describe electromagnetism today. Maxwell coined the phrase “the 
electromagnetic field” as the name for his theory, based on the hypothesis that 


12 Michael Faraday (1791-1867) is often regarded as the greatest experimentalist in the history of 
physics. From humble origins Faraday rose to fame in science, and his biography is truly inspirational. 
Electromagnetic induction, discovered by Faraday, is the basis for the electric power we use every day. 
13 Joseph Henry (1797-1878) was a teacher in Albany, and his teaching duties were so heavy that he 
could only do research in August. He later became the first director of the Smithsonian Institution in 
Washington. 

14 James Clerk Maxwell was bom in Edinburgh in 1831 and died in Cambridge in 1879. His field 
equations, published in 1864, are the subject of many physics textbooks, including this one. 
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the cause of electromagnetic phenomena is found in the space surrounding the 
charges. He was guided by fluid mechanics and elasticity—each of which is a 
mechanical system in a medium—and regarded the electromagnetic field as a 
measure of stresses and strains in a medium called the “aether.” Today, however, 
we ascribe physical reality to the field itself, and deny the existence of an aether. 

To construct a mathematically consistent set of field equations, Maxwell had 
to postulate the existence of a new electromagnetic effect, not known from ex¬ 
periments at that time: the displacement current. The resulting equations have 
wave solutions. When Maxwell calculated the wave speed, from measured elec¬ 
tric and magnetic force parameters, he found that the wave speed has the same 
value as the measured speed of light. Thus this mathematical theory revealed that 
light is electromagnetic. Maxwell’s equations form a unified theory of electricity, 
magnetism, and optics. Electric and magnetic effects are described by Maxwell’s 
equations. 


1.2 ■ ELECTROMAGNETISM IN THE STANDARD MODEL 

The subject of this book is classical electromagnetism. Electromagnetism is one 
of the fundamental interactions of nature. But it is important to realize that it 
is a part of a larger picture. The purpose of this section is to describe briefly 
how electromagnetism fits into our current knowledge of physical forces. The 
current theory of the fundamental interactions is called the Standard Model. In 
this model, all forces and particle reactions derive from four basic interactions: 
electromagnetic and weak, chromodynamic, and gravity. 

The standard model for the electroweak interactions is a field theory. The two 
types of weak interaction (charged and neutral) together with electromagnetism 
are all described by a unified field. The complete theory is consistent with the 
principles of relativity, and has interesting symmetries among the interactions. 
Electromagnetism is part of this unified theory. 

Chromodynamics, or quantum chromodynamics (QCD), is the theory of the in¬ 
teractions between quarks and gluons—the constituents of protons and neutrons. 
The strong forces exerted by protons and neutrons in an atomic nucleus are a 
consequence of QCD interactions between the constituent quarks. QCD, too, is a 
field theory. It is a remarkable discovery, established during the final quarter of 
the 20th century, that the basic interactions of elementary particles (except grav¬ 
ity) are described by field theories analogous to electromagnetism. 15 

The modem theory of gravity is Einstein’s general relativity, in which space- 
time is non-Euclidean. An important unsolved problem in theoretical physics is 
to develop a fully unified theory, combining the electroweak and chromodynamic 
field theories and gravity. 

15 In the modem jargon the electroweak and chromodynamic theories are gauge theories , i.e., pos¬ 
sessing a special symmetry. The electromagnetic field theory also possesses a gauge symmetry, which 
we’ll learn about in Chapter 11. 



6 


Chapter 1 History and Perspective 


Elementary Charges, Photons, and QED 

The basic phenomena of electromagnetism, from which all electromagnetic ef¬ 
fects derive, are interactions involving elementary charged particles and photons. 
These particles are so small (in energy) that they can only be described accurately 
by quantum theory. The quantum theory of electromagnetism— quantum electro¬ 
dynamics, or QED—has been tested by experiments more precisely than any other 
theory in physics. 

The elementary charges in matter are the electron, which has negative charge 
— e where e = 1.6 x 10 -19 C; and the quarks, which have fractional charge |e (u 
type) and —\e{d type). A proton has three valence quarks uud\ its charge is +e, 
i.e., the sum of the valence quark charges, because charge obeys the superposition 
principle. A neutron has valence quarks udd, so its charge is 0. 

What Is Electric Charge? Charge is a property of the elementary particles of 
matter. The property belongs to electrons and quarks. Therefore charge is quan¬ 
tized, in units of e/3. In ordinary matter quarks are confined in bound states with 
integer charge—protons and neutrons—by QCD forces. So, in ordinary matter 
charge is quantized in units of e. The value of the electric charge of an elementary 
particle is a coupling constant : It determines the strength of interaction between 
the particle and the electromagnetic field. 

The electromagnetic field is also quantized. The smallest (in energy) excitation 
of the field is a single photon. A photon is a packet of field with energy E = hv 
and momentum p = hv/c, where v is the frequency of oscillation of the field, and 
h = 6.63 x 10~ 34 Js is Planck’s constant. The energy and momentum are related 
by E = pc, which is the same as for a massless particle in special relativity. 
Light has both particle-like and wave-like behavior. A beam of light is a stream 
of photons; but light undergoes interference and diffraction like waves. It seems 
hard to comprehend this dual nature of light, but the particle and wave aspects 
of light are reconciled in the quantum theory QED. Classical electromagnetism 
describes the wave aspect of light, and we’ll study electromagnetic waves in detail 
in Chapters 11-15. 


The Classical Limit 

Classical electromagnetism is the science of electromagnetic phenomena in 
macroscopic systems. 

In a macroscopic system, the energy, charge, and field strength are so large that 
the quantization, i.e., discreteness, of these quantities is irrelevant. For example, 
consider a wire carrying current I = 1 A. The rate at which electrons pass a point 
in the wire is 1/e = 6 x 10 18 per second, which is so large that the discreteness 
of the current in electrons is unobservable. Or, consider a radio transmitter with 
frequency v = 1 MHz and power P = 1 kW. The rate of photon production is 
P/hv = 1.5 x 10 30 per second; again, the fact that the radio wave consists of 
discrete photons is unobservable because the number of photons is so large. Such 
macroscopic systems are described accurately by the classical theory. 
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Classical electromagnetism is the limit of QED for large systems, i.e., systems 
that involve many elementary charges or many photons. In QED the electromag¬ 
netic field is subject to quantum uncertainty, due to emission and absorption of 
single photons. The classical field is the mean field, in the sense of quantum the¬ 
ory, the average over the quantum probability distribution of field configurations. 
In a macroscopic system the uncertainty of the field is very small compared to the 
mean field, as illustrated by the previous two examples. 

The mean field of QED satisfies the classical field equations of Maxwell. 
Therefore Maxwell’s classical field theory is an accurate description of electro¬ 
magnetic effects for macroscopic charged systems and waves. 

Note that, of the fundamental interactions, only electromagnetism has both 
quantum and classical realms accessible to experiment. Gravity is so weak that 
on the quantum scale it is immeasurably small. The weak and chromodynamic 
interactions have very short range, so they do not have observable macroscopic 
effects that could be described by classical theories. Electromagnetism, uniquely, 
follows quantum theory on the atomic scale, and classical theory in the macro¬ 
scopic limit. 

This book is concerned with the classical electromagnetic field. 

Conservation of Charge 

That charge is quantized is not important in classical electromagnetism, because 
the quantum of charge e is so small. That charge is conserved is essential. 

One statement of conservation of charge is that the net charge of an isolated 
system is constant. A more far-reaching statement is that if the charge of any sam¬ 
ple of matter changes by A q, then the charge of matter in contact with the sample 
must change by —A q. This conservation law is ultimately based on experimen¬ 
tal observations. It is a basic postulate of classical electrodynamics and a conse¬ 
quence of QED. The particles that carry charge, namely electrons and quarks, are 
themselves conserved, so the sum of their charges is constant. Creation or anni¬ 
hilation of particle-antiparticle pairs can occur in relativistic collisions, but still 
the charge is conserved because particle and antiparticle have equal but opposite 
charges. All interactions of the Standard Model respect this conservation law . 16 


FURTHER READING 

On the history of the science of electromagnetism: 

Edmund T. Whittaker, A History of the Theories of Aether and Electricity (Harper, New 
York, 1960). 

Gerrit Verschuur, Hidden Attraction; The History and Mysteiy of Magnetism (Oxford Uni¬ 
versity Press, New York, 1993). 

Richard Feynman, QED; The Strange Theoiy of Light and Matter (Princeton University 
Press, New Jersey, 1985). 

16 In QED, the gauge symmetry implies conservation of charge through Noether’s theorem. 
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E. Kashy and S. McGrayne, “Electricity and Magnetism”, Encyclopaedia Britannica, 15th 
ed., Vol. IB, pp. 159-194 (1995). A very good overview of the subject. 

Introductory textbooks: 

R. P. Feynman, M. L. Sands, and R. B. Leighton, The Feynman Lectures on Physics, Vol¬ 
ume 2 (Addison Wesley, Reading, 1994). This book and its companion volumes are 
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P. M. Fishbane, S. Gasiorowicz, and S. T. Thornton, Physics for Scientists and Engineers, 
2nd ed. (Prentice Hall, New Jersey, 1996) 

D. Halliday, R. Resnick, and J. Walker, Fundamentals of Physics, 6th ed. (John Wiley, New 
York, 2001). 

E. M. Purcell, Electricity and Magnetism, Berkeley Physics Course Volume 2, 2nd ed. 
(McGraw-Hill, New York, 1985). 

R. Wolfson and J. M. Pasachoff, Physics, 3rd ed. (Addison-Wesley, Reading, 1999) 
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Vector Calculus 


“Mathematics is distinguished from all other sciences, 
except only ethics, in standing in no need of ethics.” 
From the essay “The Essence of Mathematics” 
by Charles Sanders Peirce (1839-1914). 


Electromagnetic phenomena are caused by electric and magnetic fields, E(x, t) 
and B(x, t). A field is a physical entity that fills a volume of space. E(x, t) and 
B(x, t) are vector functions of position x and time t. The most basic equations 
of electrodynamics, the Maxwell equations, are partial differential equations that 
state how the fields change in space and time. The body of mathematical knowl¬ 
edge that we need to describe the physics of fields is vector calculus. 

The purpose of this chapter is to review some important results in vector cal¬ 
culus. Anyone studying electromagnetism at the level of this book will probably 
already have some knowledge of this mathematics. Several good textbooks on 
vector calculus, such as Refs. [1] and [2], present theorems, proofs, interesting 
examples and exercises. This chapter summarizes the important results. 

Vector calculus is an important tool in studying electricity and magnetism. 
Maxwell and the other early developers of this subject did not have the advantage 
of using today’s vector calculus, and a study of their work shows how difficult and 
tedious the mathematics could be without it. Much of what is needed to use vector 
calculus is discussed in this chapter, and additional techniques will be introduced 
later when needed in specific applications. Our treatment includes the use of suffix 
notation (also called index notation), a modem approach, for some proofs and 
calculations. Suffix notation is an excellent method, but, frankly, it takes some 
getting used to at first. It is not necessary to master everything in the chapter the 
first time through it. Mastery will come with experience. It may be useful to revisit 
this material as needs arise. 

A Comment on Notation 

In calculations with vectors, it is essential to distinguish between different types of 
quantities: vectors, vector components, unit vectors, and scalars. All are different, 
and one must adopt notations that identify the type of each term in a mathematical 
expression. Table 2.1 shows the notations used in this book. A physicist must 
habitually use a notation that identifies which kind of quantity is represented by a 
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TABLE 2.1 Notations 


Quantity 

Notation 

Examples 

vector 

boldface 

x, E 

vector 

suffix notation 

X A’ E ‘ 

unit vector 

hat ~ 

i ,n 

vector component 

plainface and subscript 

Ex, Bg 

vector component 

subscript with vector in () 

(E)x, (B) e 

vector magnitude 

plainface or vector in | | 

|E| = E 

scalar 

plainface 

t,U 


symbol. In handwritten equations a vector must be identified as such, e.g., by an 

—^ 

arrow over the symbol; for example, E( x , t) for E(x, t). Every vector must have 
its arrow, and every unit vector its hat! 

Getting the notation right is truly important! In complicated mathematical 
problems, an error of notation is a common cause of a failed calculation. 


2.1 ■ VECTOR ALGEBRA 
2.1.1 ■ Definitions 

In elementary mathematics a vector is defined as a quantity with a magnitude and 
a direction. For our purposes, we need to be more precise. The most basic example 
of a vector is the position vector x of a point P in space, with respect to a chosen 
origin. 

In this book the symbol x is used to denote a position in space, or a position 
vector. Another position in the same system might be called x'. In other courses 
the reader may have used the symbols r and r' for this purpose. So, for example, 
a function of position in our notation is denoted F{\). We use x, rather than r, 
so that there is no chance of confusing the vector x with its magnitude r = |x|. 
Also, we prefer the symbol x for the position vector because r is often used to 
denote a relative vector , such as the vector from one charge to another. We use x 
for the position with respect to a fixed origin in space so that there is no chance of 
confusing x with a relative vector r. 1 

In terms of Cartesian coordinates, the position vector x of a point P is written as 

\=xi +yj + zk. (2.1) 

Here i , j , and k are the unit direction vectors of a set of Cartesian axes; and x, y, 
and z are distances from the origin along the axes, as illustrated in Fig. 2.1. The 
components x, y, z of x are lengths', i, j, k are dimensionless. 

1 The notation x for position is also used in the famous graduate textbook on electrodynamics by J. D. 
Jackson. 
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z 



FIGURE 2.1 Cartesian coordinates, x is the position vector of the point P; the compo¬ 
nents of x are the coordinates of P. We write x = xi + y j + zk. 


If the coordinate axes are rotated, but the point P is left fixed in space, then the 
direction vectors change, i, j, k i,j,k; and the coordinates of P on the 
new axes are different from those on the original axes, x, y, z —> x', /, z!. The 
vector x does not change, because P is fixed in space, so 

x=x'i +/}' +z'k'. (2.2) 

For example, for a rotation by angle 6 about k, as shown in Fig. 2.2, the transfor¬ 
mation of axis directions is 

i = i cost? + j sin 6 (2.3) 

j = — i sin0 + j cos 9 (2.4) 



FIGURE 2.2 Rotation of coordinate axes by angle 9 about axis k. The point P is fixed 
as the axes rotate. 
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k = k; (2.5) 

and the transformation of coordinates, which follows from (2.2), is 

x = i • x = x cos 6 + y sin 9 (2.6) 

y' = j • x = — x sin 6 + y cos 6 (2.7) 

Z = k • x = z. (2.8) 


(For example, to derive (2.6), x! is the dot product of (2.2) and i ; the right-hand 
side of (2.6) is obtained from the dot product of (2.1) and (2.3).) The coordinates 
transform linearly, so in general we may write the transformation as a matrix 
equation 

where R is a 3x3 matrix. 2 The matrix /^depends on the angle andaxis of rotation, 
but not on P. That is, the components of all position vectors transform by the same 
rotation matrix. 

The definition of a scalar is a quantity that does not change under rotation of 
the coordinate axes. For example, the charge density p(x) is a scalar function. The 
charge per unit volume at the point P is invariant under rotations of the coordinate 
axes. The definition of a vector is a quantity with three components that transform 
under rotation in the same way as the coordinates of a point. Let A be a vector 
with components A x , A y , and A z in the original coordinate system. Then the 
components of A with respect to the rotated axes must be 

^ A'y j = R ^ 4 j (2.10) 

where R is the same 3x3 matrix as in (2.9). Equation (2.10) is the defining 
property of a vector. 

The electric and magnetic fields 

E = E x i + E y j + £ z k 
B = B x i + By j + B z k 

are examples of vectors. Just having three components is not enough to make 
a quantity a vector. For example, i E x + j E y + k B z is not a vector. The three 
components must be geometrically related such that (2.10) holds for coordinate 
rotations. 

2 Exercise 5 at the end of this chapter. 
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2.1.2 ■ Addition and Multiplication of Vectors 

The product of a scalar a times a vector A is a vector aA whose components are 
aA x , aA y , and aA z . The sum of any two vectors A and B is a vector A + B, 
whose components are A x + B x , A y + B y , and A z + B z . It is straightforward to 
verify that aA and A + B satisfy the definition of a vector (2.10), as does any 
linear combination of vectors, aA + /SB. 

We also define multiplication of vectors. Indeed there are two multiplicative 
operations, dot product and cross product. The dot product, or scalar product, of 
two vectors A and B is a scalar A • B, defined by 

3 

A B = A X B X + A y B y +A z B z =J2 a >B,- (2.11) 

i=i 

(In the last expression the values of i = 1, 2, 3 correspond to x, y, z; i.e., A\ = 
A x , Ai = A y , and A 3 — A z . We shall use this notation on many occasions.) A • B 
is a scalar because it does not change under the transformation (2.9): 

( B x \ 

A' • B' = (RA) ■ (/?B) — ( A x A y A z )/? r /? B y = A B (2.12) 

V J 

The third equality (which proves the claim that A • B is invariant) is true because 
any rotation matrix is an orthogonal matrix ; that is, R T = R~ l . (R T is the trans¬ 
pose of R, and R~ l is the inverse of R.) We will discuss the geometric meaning 
of the dot product in Sec. 2.1.4, from which it will be obvious that A • B is a scalar. 

The cross product, or vector product, of two vectors A and B is a vector A x B. 
The ; th component of A x B is 


(A x B),- = AjBk — A k Bj (2.13) 

where ( ijk) is a cyclic permutation of (123). 3 (Again, each of the indices i, j, and 
k takes a value 1, 2, or 3, which correspond to x, y, or z respectively.) A handy 
way to calculate a cross product is to use the determinant formula 


A x B = 


i j k 

A x A v A 7 
B x By B z 


(2.14) 


Another formula, which may look difficult at first sight but is very powerful, is 


3 3 

(AxB)/=EE ^ijk A j Bf, 
7=1 <t=l 


(2.15) 


where is the Levi-Civita alternating tensor. The value of 6123 is 1, and is 
antisymmetric under exchange of any two indices, (e,-^ is sometimes called the 

3 The cyclic permutations of (123) are (123), (231), and (312). 



14 


Chapter 2 Vector Calculus 


completely antisymmetric tensor for three dimensions.) The antisymmetry im¬ 
plies that €ijk is 0 unless i, j, and k are all different. The nonzero components of 
€,jk are 


f 123 = e 231 = ^312 = 1. 

^213 = e 132 = ^321 = —1- (2.16) 

Later we will use the Levi-Civita tensor to derive useful vector identities involving 
cross products. 

It can be shown that A x B satisfies the definition (2.10) of a vector, 

(A xB)'=A'xB' = K(Ax B) (2.17) 

for any rotation R. The geometric meaning of A x B will be discussed in Sec. 
2.1.4. 

An important property of the cross product is that B x A = —A x B. This 
noncommutativity of the cross product is obvious from the definition (2.13), or 
from the antisymmetry of e,jk in (2.15). Also, by setting B = A we see that the 
cross product of a vector with itself is 0, A x A = 0. 

2.1.3 ■ Vector Product Identities 

Cross products and dot products often appear in multiplicity in calculations. There 
are important identities that relate such multiple products, of which we shall dis¬ 
cuss two. These identities are used often in vector field theory. 

One identity, true for all vectors, is 

A-(B x C) = (A x B) ■ C. (2.18) 

The most straightforward (but rather laborious!) way to verify this equation is 
to write out each side in components, and verify that the six terms are the same 
on both sides. There is a much easier way to prove the result, and other product 
identities, but it requires an understanding of suffix notation. We need to make a 
brief digression on suffix notation. 4 


On Suffix Notation 

In suffix notation, the vector A is denoted A,-. The notation is potentially danger¬ 
ous, because A, stands for two different things: the ;th component of the vector, 
and the vector itself. The context must be used to decide which is meant We use 
suffix notation rarely in this book, mainly just in proof s of vector identities. There 
is one branch of theoretical physics where suffix notation is always used for vec¬ 
tors, namely, the theory of relativity. This notation for vectors has become more 
widespread in physics in recent years, because of its prevalence in relativistic cal¬ 
culations. 

4 Suffix notation is explained in detail in Ref. [1], 
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The dot product of two vectors is 

A • B = A X B X + A y By + A Z B Z . (2.19) 

In suffix notation, this is written very succinctly as 

A B = AiBi, (2.20) 

making use of the Einstein summation convention: Any repeated suffix is under¬ 
stood to be summed from 1 to 3. So, A, /f, means £T =1 A, 6,. The sum is just 
understood. A repeated suffix is a “dummy index,” because it is summed over all 
of its values; it does not have a particular value. For this reason Aj Bj is exactly 
the same quantity as A, B ( ; both are equal to the sum in (2.19). The summation 
convention, invented by Einstein, is a wonderfully simplifying notation if we are 
careful and understand its usage. 

The cross product of two vectors is 

AxB = ( AyB z — A z By) i + ( A Z B X A X B Z ) j + ( A x By — AyB x ) k. (2.21) 

Suffix notation is more compact, 


(A x B),- = CijkAjBk 


( 2 . 22 ) 


where j and k are summed. 

Now, to prove (2.18) using suffix notation, note that the left-hand side of the 
equation is 


Ai (B x C),- = eijkAiBjCk (2.23) 

where again we have used the summation convention; on the left-hand side of 
(2.23) i is summed, and on the right-hand side i, j, and k are all summed. The 
right-hand side of (2.18) is 

(A x B) fc Ck — f-kij A { BjCk (2.24) 

where again any repeated indices are summed from 1 to 3. But ekij = f -ijk for any 
i, j, k by the complete antisymmetry of the Levi-Civita tensor; 

£ki j = —*ikj = A-Cijk- 

Thus the two sides of (2.18) are equal, and the identity is proven. 

Another important vector identity is the double cross product identity 

A x (B x C) = B (A • C) - C (A • B). (2.25) 

This identity is used over and over again in field theory. To prove it by writing 
out both sides of the equation in components would be quite tedious. Instead, we 
shall prove it using suffix notation. 
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The left-hand side is a vector; in suffix notation, it is [A x (B x C)] { . Now, 
convert the first cross using (2.15), and then the second by the same equation, 

[A X (B X C)]; = jkAj (B X C)ft = €ijk€klmAjB(C m . (2.26) 

Again the repeated indices (jkim) are summed from 1 to 3; the second expression 
is a double sum, and the third a quadruple sum. (Note that in the second applica¬ 
tion of (2.15) it was necessary to introduce new dummy indices £ and m because 
j and k were already in use.) An identity of the Levi-Civita tensor is 


Zijk.Zk.tm = SuSjm ~ 


(2.27) 


where <5, y is the Kronecker delta tensor 


8 


‘j 


I if i = j 
0 if i 7 ^ j. 


(2.28) 


The proof of (2.27) is an exercise. 5 A suffix sum involving the Kronecker delta 
may be eliminated by putting the summed suffix equal to the other suffix of the 5; 
for example, &ijuj = Uj. Using this trick for the sums over l and m in (2.26) we 
find that the right-hand side of (2.26) is 


AjSuBeSjmCm - Aj8jt_Bi8i m C m = AjBiCj - AjBjCi (2.29) 

(sum on j implied) which is, in suffix notation, the vector on the right-hand side 
of (2.25). Hence the double cross product identity is proven. 

Having proven that (2.18) and (2.25) are true for any vectors, we may use these 
identities freely in calculations. We don’t need to rederive the result each time we 
use it. Students often find it difficult to know when to use a vector identity, or to 
remember the identities. Practicing physicists may not remember all the identities 
either, but when an identity is needed it can be derived “on the fly” (after plenty 
of practice!) using the Levi-Civita tensor. 


2.1.4 ■ Geometric Meanings 

Our discussion of vectors has been purely algebraic so far. Vectors also have a 
geometric aspect. The geometric meanings of the dot and cross products are con¬ 
tained in the equations 


A B = |A||B|cos0, (2.30) 

A x B = |A||B|sin0n, (2.31) 

where 0 is the angle between A and B. Our notation is that | A| denotes the mag¬ 
nitude of A, 


5 See Exercise 3. 


|A| = yjAj + A2 + A2 = (. A k B k ) 1 ' 2 . 


(2.32) 
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FIGURE 2.3 Direction of the cross product. A x B is perpendicular to both A and B, 
i.e., to the plane spanned by A and B, in the sense given by the right-hand rule. 


For example, for the position vector x the magnitude |x| is the distance from the 
origin to P. 

A B is the projection of A on B times the magnitude of B, or the projection of B 
on A times the magnitude of A. If A and B are parallel, then AB is just the product 
of their magnitudes. If A and B are perpendicular, then A • B is 0. Because lengths 
of vectors and angles between vectors are invariant under coordinate rotations, it 
is obvious that A • B is a scalar. 

In (2.31) n is the unit vector perpendicular to the plane spanned by A and B, in 
the sense defi ned by the right-hand rule, as shown in Fig. 2.3: If the fingers of your 
right hand curl from A to B then the thumb points in the direction of A x B. The 
cross product is perpendicular to both A and B. The factor n in (2.31) is crucial; 
without that factor (2.31) would be nonsensical, because the left-hand side is a 
vector, but the right-hand side (without the n) would be a scalar. A vector cannot 
equal a scalar! When doing calculations with cross products, it is important not to 
lose the vector. Again, the geometric statement (2.31) makes it obvious that A x B 
transforms as a vector under coordinate rotations. 


EXAMPLE 1 Let A be the vector i + j in the xy plane, and B the vector j + k 
in the yz plane, shown in Fig. 2.4. Find the magnitude and direction of A x B. 


z 



FIGURE 2.4 Example 1. C is A x B. The vectors shown are A = i + j,B = j + k, 
and C = i — j + k. The direction of C is perpendicular to both A and B. 
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We could calculate A x B from the determinant formula, but for this example 
it is simple enough to expand the product, as 

A x B = (i + j) x (j + k) 

= i xj + i xk + j xj+j xk 

= k- j + I. (2.33) 

The magnitude of i — j + k is V3- This magnitude agrees with (2.31); the mag¬ 
nitudes of A and B are both >/2, and the angle 0 between A and B is 60 degrees 
(because cost? = AB /(AB) = I/2) so AB sin0 = \/3. The direction of i — j +k 
is indicated in Fig. 2.4 by C, perpendicular to both i + j and j + k. 


2.2 ■ VECTOR DIFFERENTIAL OPERATORS 


2.2.1 ■ Gradient of a Scalar Function 

The fields of electromagnetism are functions of position x, so there are three spa¬ 
tial partial derivatives: 3/3x, 3/3y, and 3/3z. The gradient operator V, called 
“del,” combines the three partial derivatives into a vector operator. The action of 
V on a scalar function /(x) produces a vector function V/. Using a Cartesian 
coordinate system V / is 

~df ~df .3/ 

V/= i/ + (2.34) 

3x dy dz 

To prove that V/ is a vector, i.e., that it satisfies the definition (2.10), we must 
determine how its components transform under a rotation of the Cartesian axes. 
The (th component is (V/) ( - = 3//3x,-. The transformed component is 


( V 7) ; 


Bf_ = 3f_ 

3 x'j r—t dx ■ 3x ; ’ 

1 ;=l 1 1 


(2.35) 


where the second equality is by the chain rule of differentiation. Now, the trans¬ 
formation (2.9) of the coordinates implies Xj = ( R~ l )jkx ' k , with a sum over k 
implied by the Einstein summation convention. Therefore, 




(2.36) 


where Rjj is the ij component of the matrix R in (2.9), and we have used the 
fact that R~ l = R 1 for any rotation matrix. Thus V/ transforms as a vector: 
(V7); = Rij (V /) j (sum over j implied). 

Equation (2.34) is the formula for calculating V/ in a Cartesian coordinate 
system. But in field theory it is often more useful to use curvilinear coordinates, 
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like cylindrical or spherical polar coordinates. The generalization of (2.34) to 
curvilinear coordinates is nontrivial, as we shall see. So we need a coordinate- 
independent definition of the gradient. Consider the change of /(x) resulting from 
an infinitesimal displacement dx. That is, dx is the vector 

dx = i dx + j dy + k dz. (2.37) 

The change of / from x to x + dx is 

df = /(x + dx) - f{x)=dx d f+dy d f+dz d 4- = dx ■ V/. (2.38) 

dx dy dz 

The third expression in (2.38) is Cartesian, but the fourth form is coordinate inde¬ 
pendent. Therefore the differential relation df = dx ■ V/ is the definition of V/. 
We shall use this definition later to express V/ in curvilinear coordinates. 

An important theorem is that the direction of V/ at a point x is perpendicular 
to the surface of constant / that includes the point x. To prove the theorem, note 
that for any displacement dx along the surface of constant /, df is 0. But then 
by (2.38) V/ is orthogonal to dx. Since dx is an arbitrary displacement on the 
surface, V/ must point in the direction perpendicular to the surface. 

For an arbitrary displacement dx the change of / is df = |V/||dx|cos0 
where 6 is the angle between V/ and dx. The change of / is maximum if 6 = 0. 
In other words, V/ points in the direction of the maximum increase of /(x). The 
magnitude of V/ is the rate of change of / in that direction. 


2.2.2 ■ Divergence of a Vector Function 

The operator V acts algebraically as a vector. The divergence of a vector function 
F(x), denoted V • F and read as “del dot F”, is a scalar function. In Cartesian 
coordinates the divergence is given by 6 


V F = 


dF x dFy dF, 
dx dy dz 


(2.39) 


It can be shown that V • F is invariant under rotation of the coordinate system. 

Again, we need a coordinate-independent definition of the divergence, that can 
be used to derive V • F in curvilinear coordinates. The definition follows from 
the “divergence” property of this operator: V • F is a measure of how the vector 
function F(x) diverges, i.e., spreads out from x. Let d V be an infinitesimal cubic 
volume centered at x, of size e x e x e, aligned with the Cartesian directions i , j , 
and k as shown in Fig. 2.5. The flux of F outward through the boundary surface 
dS of d V is the sum of six terms, from the six faces of the cube. The area of each 
face is e 2 ; and for opposite faces the outward normals are +e, and —e,-, where ei, 

/V /V « « « 

e 2 , e 3 denote the Cartesian directions i , j , k. Thus, treating e as an infinitesimal 
length, the flux through dS is 

6 In suffix notation, V • F = dFi/dxi. 
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FIGURE 2.5 An infinitesimal cube. We consider the outward flux of a vector field to 
derive a coordinate-independent definition of divergence. 


(f) F • dA = [F, (x + ee,7 2) - F, (x - ee,/2)l e 2 

JdS i=1 

i^ = (rTK>. (140) 

f~l dx ‘ 

Comparing the last and first expressions, equation (2.40) says that the divergence 
is equal to the flux per unit volume through an infinitesimal closed surface. This 
statement serves as a coordinate-independent definition of the divergence; letting 
S be the boundary of V, 


V • F = lim — ({>¥■ dA. (2.41) 

v-o VJs 

Another important differential operator is the Laplacian V 2 , read as “del 
squared”. Its action on a scalar function /(x) is defined by 

v 2 / = V • (V/). (2.42) 


In words, the Laplacian is the divergence of the gradient. It is a straightforward 
exercise to show that in Cartesian coordinates 


V 2 / 


. = a 2 / | d 2 f | d 2 f 

dx 2 dy 2 dz 2 


(2.43) 


2.2.3 ■ Curl of a Vector Function 

If V • F is a scalar, then naturally V x F (“del cross F”) is a vector. It is called 
the curl of F. In Cartesian coordinates the curl may be written as a determinant 
operator, 
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i ] k 

V x F = d/dx d/dy a/9z . (2.44) 

Fx F y F z 

Or, equivalently, in terms of the Levi-Civita tensor, the / th component of V x F is 

(VxF),=€,3 (2.45) 

dXj 

(where the sum over j and k from 1 to 3 is implied ). 7 

The curl is a measure of vorticity, i.e., how the vector function curls around 
the point x. Let dS be an infinitesimal square of size e x e centered at x, aligned 
with the Cartesian directions e, and ej. (Remember, ei, £ 2 , £3 stand for i, j, k.) 
The line integral of F(x), counterclockwise around the perimeter dP(i j) of the 
square, illustrated in Fig. 2.6, is 

® F • d£ = e Fi (x — eey /2) + e Fj (x + ee, /2) 

JdP(ij) 

-€Fi(x + eej/2) - eFj(x - ee,/ 2 ) 

/a Fj dFi\ , , 

= \T J r-T ± ) € ^ = ( VxF >^ 2 (2.46) 

y OXi OX j J 

where (i jk) is a cyclic permutation of (123). The four terms in the second expres¬ 
sion come from the four sides of the square, starting at the bottom and proceeding 
counterclockwise, and again e is treated as an infinitesimal. Comparing the last 
and fi rst expressions, (2.46) says that the curl of F is equal to the circulation of F 


j 



FIGURE 2.6 An infinitesimal square. We consider the counterclockwise circulation of 
a vector field to derive a coordinate-independent definition of curl. 

7 The student should beware that (2.44) and (2.45) are not correct for cylindrical or spherical coordi¬ 
nates. The correct expressions for curvilinear coordinates are derived in Sec. 2.4. 
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per unit area around an infinitesimal loop. (By the circulation we mean <f F • di.) 
This property provides a coordinate-independent definition of the curl. To define 
the component of V x F in a specified direction n, let C denote a planar closed 
curve with normal vector n and area A; then 

n ■ (V x F) = lim - £ F • di. (2.47) 

*-o A J c 

As an example of the direction of V x F, if the curve C in (2.47) lies in the xy 
plane and is traversed counterclockwise, then the left-hand side of (2.47) is the 
+z component of the curl. 


EXAMPLE 2 Consider the scalar function /(x) = Cz. Calculate V/ and interpret 
the result geometrically. 

The gradient is V/ = k df/dz = Ck, a uniform vector field independent of x. 
In general the gradient points in the direction of the maximum increase of the 
function, which in this case is the z direction. The magnitude of the gradient is 
the slope of the function in that direction, which in this case is the constant C. 


EXAMPLE 3 Consider the vector function F(x) = Cxi. The only nonzero com¬ 
ponent of F is F x = Cx. Calculate the divergence V • F and interpret the result 
geometrically. 

Figure 2.7 shows the vectors at a few points in the xy plane, assuming C > 0. 
The divergence at an arbitrary point x is V • F = dF x /dx = C. In general V • F 
describes the net flux of F away from x. In Fig. 2.7 the net flux is the same at all 
points, and positive. For example, on a small surface surrounding x in the region 
x > 0, the flow away from x on the right side is greater than the flow toward x on 


y 



FIGURE 2.7 Example 3. A few vectors of the vector function F(x) = Cxi are shown, 
illustrating a vector function with nonzero divergence. 
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the left. For x in the region x < 0 the divergence is also positive because there is 
a flux away from x. 


EXAMPLE 4 Consider the vector function G(x) = Cx j . The only nonzero com¬ 
ponent of G is G y = Cx. Calculate the curl V x G and interpret the result geo¬ 
metrically. 


y 



FIGURE 2.8 Example 4. A few vectors of the vector function G(x) = Cx j are shown, 
illustrating a vector function with nonzero curl. 


Figure 2.8 shows the vectors at a few points in the xy plane, assuming C > 0. 
The curl at an arbitrary point x is V x G = k dG y /dx = C k. In general V x G 
describes the circulation of G around x. In Fig. 2.8 there is a counterclockwise 
circulation around the z axis, independent of x. For example, consider a small 
loop in the region x > 0. The flow in the +y direction on the right side of the 
loop is greater than flow in the +y direction on the left side, so there is a net 
circulation around the loop. 


2.2.4 ■ Del Identities 

We often encounter V acting on products of functions. Also, an expression may 
contain multiple derivatives. It is important to derive some general identities for 
expressions of this kind, so that when such an expression arises in a calculation 
it can immediately be simplified by substitution from the general rule. Table 2.2 
lists some useful identities involving V. 

The identities in Table 2.2 can be derived by expanding them in Cartesian 
components, or more compactly using suffix notation. For example, consider the 
first identity in the table. In Cartesian components it is 
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TABLE 2.2 Del identities. Here / and g denote scalar 
functions of x, and F and G denote vector functions of x. 


Derivatives of products: 

V(/g) 

= /Vg + gV/ 

V • (gV) 

= gV•F + Vg•F 

V x (gF) 

= gV x F + Vg x F 

V-(FxG) 

= (V x F) • G - F • (V x G) 

V x (F x G) 

= (G ■ V)F - (F • V)G + F(V ■ G) - G(V • F) 

Products of derivatives: 

V x (V/) 

= 0 

V • (V x F) 

= 0 

V x (V x F) 

= V(V • F) - V 2 F 


V (fg) = \ ( fg) + j ^ (fg) + k~ ( fg) 

dx dy dz 


= f {'k + i ^ + 

= /V* + g\f. 


7 dg 

dy 


%)+'(%«) 


More compactly, in suffix notation V/ is df/dxi, so the identity is 

d(fg) dg df 

= f^T + ( 2 - 48 > 

dxj dxi dXi 

but this is just the Leibniz rule for differentiating a product. 

Or, consider the second identity. Writing both sides of the identity in suffix 
notation, the identity is 


d(gF,) dFi dg 

J f-^ = s T ^ + ;r Lf '- ( 2 - 49 ) 

oXf dxj dxj 

In each term i is summed from 1 to 3 in accord with the summation convention. 

The equation is again just the Leibniz rule, so the identity is proven . 8 These two 
quick derivations illustrate the power of the suffix notation. 

As an example involving two derivatives, consider V x (V/). In suffix notation 
this vector is 


(V X V/)i = € ijk 


9 2 / 

dXjdXk 


8 As an exercise the reader should prove the identity again by writing out the Cartesian components 
explicitly. 
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The factor 3 2 / /dxjdx k is symmetric under exchange of j and k, whereas the 
factor €jjf. is antisymmetric, and this implies that the right-hand side is 0: Inter¬ 
changing jk in the first step below produces a minus sign from e,j k - But j and k 
are dummy indices, summed from 1 to 3; so renaming the dummy indices ( k —> j 
and j —> k) in the second step we have 

.. 9 2 / = a 2 / = 9 2 / 

l ^ k dxjdxk dxkdxj dxjdxk 


That is, the left-hand side is equal to its negative, so it must be 0. (See also Ex¬ 
ercise 4.) Hence the identity V x V/ = 0. In words, the curl of the gradient of 
a scalar function is identically 0. The identity is true no matter what the function 
/(x) is. Similarly, consider V • (V x F). In suffix notation this scalar is 


V • (V x F) = e ijk 


d 2 Fk . 

dxj 3 xj ’ 


again we have a suffix sum of a symmetric and an antisymmetric tensor, which 
is 0, 9 so V • (V x F) =0. The divergence of the curl of a vector function is 
identically zero. This and the previous identity are important in electromagnetic 
theory, because they lead to the introduction of scalar and vector potentials for the 
electric and magnetic fields. 

We have used suffix notation, the Einstein summation convention, and the 
Levi-Civita tensor to derive useful identities, but these techniques will not be 
needed for many calculations in this book, because we can simply apply the rele¬ 
vant identity from Table 2.2 when needed. 


2.3 ■ INTEGRAL THEOREMS 

The purpose of this section is to describe two theorems involving integrals in 
vector calculus—Gauss’s theorem and Stokes’s theorem—that are very important 
in electromagnetic field theory. 

In our study of electromagnetism we will frequently encounter surface inte¬ 
grals and line integrals , 10 i.e., integrals of the form f s fdA and f c fdt where S 
is a surface with area element dA and C a curve with length element dt. The flux 
of a vector field F through S is the surface integral of n • F(x) where n is the unit 
normal vector at the point x on S. Positive flux means F points through S in the 
sense of n. The circulation of F around a loop C is the line integral of t • F(x) 
where t is the unit tangent vector at the point x on C. Positive circulation means 
F loops around C in the sense of t. 

9 See Exercise 4. 

1 References [ 1 ] and [2] at the end of this chapter, and any of the introductory physics textbooks listed 
at the end of Chap. 1, have good discussions on surface integrals and line integrals. 
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2.3.1 ■ Gauss's Theorem 

Gauss’s theorem, also called the divergence theorem, states that the flux of a vec¬ 
tor quantity outward through a closed surface S is equal to the integral of the 
divergence of the function in the enclosed volume V, 

<j> F ■ dA = J V F d 3 x. (2.50) 

dA is a directed area element on S', for an infinitesimal patch on S the direction 
of dA is normal to S and the magnitude is the area dA. Also, d 3 x is a volume 
element in V. 11 The “O-integral” notation rf s indicates that the integration region 
S is a closed surface, i.e., a surface with no boundary curve, like a sphere or cube. 

We have already seen an example of Gauss’s theorem, for an infinitesimal sur¬ 
face, in (2.40). The mathematical proof for an arbitrary surface can be found in 
math books, such as Refs. [1] and [2]. The basic idea of the proof is this: Sub¬ 
divide the volume V into infinitesimal volume elements. Apply (2.40) to each 
subvolume, i.e., convert the integral of V • F in the subvolume to the surface flux 
of F through the boundary of the subvolume. Then note that for each subvolume- 
boundary-surface in the interior of V there are two flux integrals, which cancel in 
the total integral; because each interior surface belongs to the boundary of two ad¬ 
jacent subvolumes, with the flux in opposite directions from the two subvolumes. 
This point is illustrated in Fig. 2.9. All that is left after canceling the internal 
fluxes is the flux through the outer surface S; hence (2.50). 

Gauss’s theorem is reminiscent of the fundamental theorem of calculus, 



FIGURE 2.9 Illustration of the proof of Gauss’s theorem. A volume V is subdivided 
into very small subvolumes. The integral of V • F in V is the sum of integrals in the 
subvolumes. Each subvolume integral is equal to the flux of F outward through its boundary 
surface. The arrows indicate outward normals dA. The flux integrals over interior surfaces 
cancel in pairs, leaving just the flux outward through the boundary surface S. 

11 We could use dV to denote the volume element, but that is potentially confusing because V also 
denotes the potential ! The notation dfx emphasizes that it is a 3D element. 
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for any function g(x ). In words, the integral of the derivative is determined by the 
function at the boundary points of the range of integration. Similarly in (2.50) the 
integral of the divergence is determined by the function at the boundary S of the 
region of integration V. But in (2.50) the region of integration is a volume and the 
boundary is a closed surface, over which F • n is integrated. 

Gauss’s theorem may appear abstract at first sight and this discussion of it has 
been quite brief. But it will become more familiar in Chapter 3 where we will use 
it often. We’ll also gain good physical insights into its meaning as we apply it to 
electric and magnetic systems throughout this book. 

2.3.2 ■ Stokes's Theorem 

Stokes’s theorem states that the circulation of a vector function around a closed 
curve C is equal to the flux of vorticity 12 through any surface S bounded by C, 

Fdl = J (VxF)-dA. (2.51) 

dl is a length element of C tangent to C; that is, dl = id l where t is the unit 
tangent vector. Also, d A is a directed area element normal to S\ that is, dA = hdA 
where n is the unit normal vector. The direction of dA is coordinated with that 
of dl by the right-hand rule: With the fingers curling in the direction of dl, the 
thumb points in the direction of dA. The “O-integral” notation indicates that C is 
a closed curve, i.e., a loop. However, the surface integral is over an open surface 
S whose boundary curve is C. 

We have already seen an example of Stokes’s theorem, for an infinitesimal 
loop, in (2.46). The proof for an arbitrary curve can be found in Refs. [1] and [2]. 
The basic idea is this: Subdivide the surface S into infinitesimal area elements. 
Apply (2.46) to each subarea, i.e., convert the flux integral of V x F through 
the subarea to the line integral of F around the boundary of the subarea. Then 
note that the line integrals along the subarea boundary segments in the interior 
of S cancel in pairs, because each interior segment belongs to the boundary of 
two adjacent subareas, with the line integral in opposite directions for the two 
subareas. This point is illustrated in Fig. 2.10. All that is left after canceling the 
interior line integrals is the line integral around the boundary curve C; hence 
(2.51). It is obvious from the proof that S in (2.51) can be any surface bounded 
by C. 13 


EXAMPLE 5 Consider the vector function F(x) = x. Calculate the divergence 
and curl of F(x), the flux through a sphere centered at the origin, and the circu¬ 
lation around any circle on the sphere. Verify that Gauss’s theorem and Stokes’s 
theorem are obeyed. 

12 “Vorticity” in this context is just another word for V x F. 

13 See Exercise 11. 
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FIGURE 2.10 Illustration of the proof of Stokes’s theorem. C is a closed curve. A 
surface S bounded by C is subdivided into very small subareas. The flux of V x F over 5 is 
the sum of integrals over the subareas. Each subarea integral is equal to the circulation of 
F around its boundary curve. The arrows indicate the tangent vectors dl. The line integrals 
of F along interior segments cancel in pairs, leaving just the circulation around C. 



FIGURE 2.11 Example 5. Tangent curves of the vector function F(x) = x. 


Using Cartesian coordinates, it is straightforward to show that V • F = 3 and 
V x F = 0. Figure 2.11 shows the tangent curves 14 of F(x), which are radial 
straight lines. The curves diverge, that is, they spread out from any point x; but 
there is no curl, that is, vorticity, at any x. The flux of F through a sphere of radius 
r centered at the origin is f F • dA = r A = 4n r 3 where A is the surface area of 
the sphere. This result agrees with Gauss’s theorem, because f V • F d^x = 3 V = 
4rrr 3 where V is the volume of the sphere. The circulation of F around any circle 
on the sphere is 0, in agreement with Stokes’s theorem, because F is everywhere 
orthogonal to the circle, so F • d£ = 0. 

14 The tangent curves of a vector function are a family of curves everywhere tangent to the vectors. 
For example, the tangent curves of the electric field are the electric field lines. 
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FIGURE 2.12 Example 6. Tangent curves of the vector function G(x) = k x x. 


EXAMPLE 6 Consider the vector function G(x) = k x x. In cylindrical coor¬ 
dinates, G(x) = (pr where r = yjx 1 + y*. Calculate the curl and divergence, 
the circulation around a circle parallel to the xy plane centered at the z axis, and 
the flux through a sphere centered at the origin. Verify that Stokes’s theorem and 
Gauss’s theorem are obeyed. 

Using Cartesian coordinates, in which G(x) = — y i +x j, it is straightforward 
to show that V x G = 2k and V • G = 0. Figure 2.12 shows the tangent curves 
of G(x), which are circles around the z axis. There is vorticity but no divergence 
at any point. The circulation of G around a circle of radius r is § G • d£ = rC = 
Inr 1 where C is the circumference of the circle. This result agrees with Stokes’s 
theorem, because J V x G • d A = 2A = 7nr 2 where A is the area inside the 
circle. The flux of G through a sphere centered at the origin is 0, in agreement with 
Gauss’s theorem, because G is everywhere tangent to the sphere, so G • dA = 0. 


2.3.3 ■ Vector Calculus in Fluid Mechanics 

Vector calculus is the language of electromagnetic field theory. This mathematics 
is also used in other areas of theoretical physics, notably fluid mechanics. The 
basic parameters for fluid motion are density, pressure, and velocity. The density 
p(x, t) and pressure p(x, t) are scalar functions. The fluid velocity v(x, t) is a 
vector function, that depends on position within the fluid. The analysis of fluid 
motion relies on vector calculus. 

The divergence and curl operators describe aspects of the fluid motion. For 
example, conservation of mass, which holds at every point in the fluid, implies 
the equation 


— + V • (pv) = 0, 


(2.52) 
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which is called the continuity equation. The divergence of p\ at x is the outward 
mass flux per unit volume there, by the definition (2.41) of divergence; therefore 
V • (pv) must equal —3 p/dt because mass is conserved. Or, applying Gauss’s 
theorem to (2.52), the mass flux outward through any closed surface S equals the 
rate of decrease of mass in the enclosed volume V: 

(fp\-dA = — — f pd 3 x. 

Js dt Jv 

As another example, the vorticity at a point is defined by V x v(x). If there 
is a whirlpool, i.e., the fluid swirls around some point, then f c v ■ d£ is nonzero 
if C surrounds the point. By Stokes’s theorem, the vorticity is nonzero in the 
neighborhood of the whirlpool; v curls around a point of vorticity. 

When studying electromagnetism, we will develop a geometric intuition for the 
direction and magnitude of fields. Understanding the vector differential operators, 
V- and V x , and their relation to flux and circulation integrals, is a necessary step 
in that direction. It may sometimes help to picture E or B as a “flow” (even if 
the fields are static). The divergence (V-) and curl (V x ) can be visualized in fluid 
flow which may help us to comprehend their meaning in field theory. 


2.4 ■ CURVILINEAR COORDINATES 

In problems with cylindrical or spherical symmetry it is more convenient to use 
appropriate curvilinear coordinates than Cartesian coordinates. Therefore, we will 
derive the formulas for calculating gradient, divergence, and curl in curvilinear 
coordinates. It is a common error to apply a Cartesian equation, (2.34), (2.39), 
(2.43), or (2.44), for cylindrical or spherical coordinates. As we shall see, these 
equations must be modified for non-Cartesian coordinates. 

2.4.1 ■ General Derivations 

In general terms, let u \, U 2 , «3 denote three coordinates (distances or angles) that 
specify the points in three dimensions. The corresponding unit vectors ei, £2, £3, 
point in the directions of independent positive displacements of mi, M2, «3, respec¬ 
tively. These unit vectors are assumed to form an orthogonal triad at each point in 
space. The infinitesimal displacement ds in space that results from changing the 
coordinates by du \, dii 2 , du 3 can always be written in the form 

ds = t\h\du\ + t 2 hidu 2 + ezhidui. (2.53) 

Here h 1 , /12, A 3 are scale factors that relate distance ( dsj ) to change of coordinate 
(diti ); in other words /;, is the ratio dsj/dut, for i = 1 , 2, 3. The scale factors are, 
in general, functions of position (m 1 , U 2 , M3). Also, the scale factor for a distance 
coordinate is dimensionless, while that for an angle coordinate has units of length. 

The simplest case is Cartesian coordinates, for which u\ = x, U 2 = y, and 
M3 = z. The scale factors are h 1 = I12 = /13 = 1 in this case. But next we will 
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derive equations for grad, div and curl in general coordinates. And then we will 
write them specifically for cylindrical and spherical coordinates. 


Gradient 

The gradient of a scalar function / is defined by the differential relation df = 
V f ■ ds. Viewing / as a function of (wi, U 2 , n 3 ), we have 


df = -—du i + -— du 2 + -—du 3 , 
dU i dl/2 dl/3 


(2.54) 


and 


V/ • ds = (yf)i h\du\ + (V /) 2 fi 2 dii 2 + (V /) 3 h^du^. (2.55) 


Equating the coefficients of the independent differentials du\,du 2 , du 3 , we find 
that the component of V/ along e, is 


(V/)« = 


J_ 3/ 

hi But 


(2.56) 


(There is no sum over i here; this is the ith component of the vector.) Note that 
this result differs from the Cartesian formula (2.34) unless h\ = /12 = /r 3 = 1, 
which is the condition for Cartesian coordinates. 


Divergence 

To derive the formula for the divergence of a vector function F(«i, « 2 - n 3 ), we 
apply the definition (2.41) to the infinitesimal cubic volume defined by displace¬ 
ments du\,du 2 , du?, (see Fig. 2.13): 



FIGURE 2.13 An infinitesimal curvilinear volume. The volume integral o f V ■ F i s the 
total outward flux of F through the six faces, ds 1 , ds2, and <A 3 are the elements of length 
in the three orthogonal directions. 
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V F dV = [ F\h2h 3 \ u]+du] - F\h2hs\u^]du2dui 

+ [F 2 hT,h l \ U2+dll2 - F 2 h2,h\\ U2 ]du2,du\ 

+ [F3h\h 2 \ U3+du3 - F 3 h]h 2 \ U3 ]du\du 2 (2.57) 

where d V is the infinitesimal volume 


dV = (h\dii\) (hidui) (h 3 du 3 ). (2.58) 


The right-hand side of (2.57) is the outward flux through the boundary surface 
of dV, consisting of 6 terms from the 6 faces of the cube. F\, F 2 , Ft, are the 
projections of F onto the directions ei, £2, £3. For example, the flux of F through 
a face with normal ei is ±F\ (hidu 2 )(h 3 du 3 ), with whichever sign corresponds 
to outward flux. Adding up the six fluxes gives (2.57); this equation is illustrated 
in Fig. 2.13. Dividing out the differentials du\,du 2 , du 3 , we find 


V F = 


1 


h\h 2 h 3 


o 0 0 

— (F\h 2 h 3 ) + — (F 2 h 3 h\) + — (F 3 h\h 2 ) 
du 1 du 2 du 3 


(2.59) 


Again, this differs from the Cartesian formula (2.39) in general. Recall that 
h 1, /12, h 3 can be functions of position. For example, ^2/23 may have a different 
value at ni + du\ than at nj. The products of scale factors must be included in 
the derivatives in (2.59). 


Laplacian 

The Laplacian operator V 2 , which we will encounter in electrostatics, is defined 
by the relation V 2 / = V • (V/). Combining (2.56) and (2.59) we have for gen¬ 
eral curvilinear coordinates 


V 2 / = 


1 9 / hjh/c 9/ \ 

h\h2h 3 ^dui\ hj dui) 


(2.60) 


where the sum is overthe three cyclic permutations of (123). 


Curl 

Curl, being a vector operator, is a bit tricky. To derive the formula for the curl of a 
vector function F(«i, «2, u 3 ), let’s start by applying (2.47), which is the infinites¬ 
imal version of Stokes’s theorem, to the infinitesimal rectangular area produced 
by displacements du\ and du 3 (see Fig. 2.14): 

(V x F) 3 (h\du\) (h 2 du 2 ) = (^2^2l« 1 +d«, - F 2 h 2 \ U] )du 2 

- {F\h\\ U2+du2 - F\h\ | H2 ) du\. (2.61) 

The right-hand side is the circulation integral, consisting of four terms from the 
four sides of the rectangular area, as illustrated in Fig. 2.14. Thus, canceling the 
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FIGURE 2.14 An infinitesimal curvilinear loop. The surface integral of n ■ (V x F) is 
the sum of the four line integrals. 


displacements du\du 2 . 


(V x F) 3 = 


h\h2 


— (^2^2) - T— (Flhi) 
\_aui au 2 


(2.62) 


The other components are similar; replace (123) by (231) or (312). We may ex¬ 
press the curl as a determinant, 


V x F = 


ei//t2^3 *i/h-ih\ e-j/h\h2 
d/dui d/du2 d/duj 
h\F\ hiFi 


(2.63) 


To use (2.63), expand the determinant in terms of minors of the first row, tak¬ 
ing care to use the correct signs. The scale factor and component of F must be 
differentiated together. For example, one term in the expansion is 


ei 3 
/*2^3 3m2 


(*3*3), 


and there are five other terms. 

2.4.2 ■ Cartesian, Cylindrical, and Spherical Coordinates 

Cartesian Coordinates 

Cartesian coordinates are ui = x, 112 = y, and M 3 = z. The scale factors 
h\,h 2 , hi, are all 1. The position vector of the point with coordinates (x, y, z) 
is 


x — x'\ +yj +zk. 

Then the above equations for gradient, divergence, and curl reduce to the Carte¬ 
sian equations. 
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FIGURE 2.15 Cylindrical coordinates r , <f>, z. The unit vectors r, <f>, and k at P point 
in the directions of increasing r, <fi. and z, respectively. 


Cylindrical Coordinates 

Cylindrical coordinates are u\ = r, 112 = </>, and U 3 = z, as defined in Fig. 2.15. 
The formulas for differential operators in cylindrical coordinates are recorded in 
Table 2.3. It is also important to know how to write the position vector x in cylin- 


TABLE 2.3 Cylindrical coordinates. 


position 

x = rr + zk 

displacement 

ds = r dr + (firdcp + k dz 

scale factors 

II 

II 

•e- 

II 

volume element 

dV = r dr d<p dz 

gradient 

V/ = r— + <£-— + k — 
dr r 9</> dz 

divergence 

„ 1 3 1 dF<p d F- 

r dr r d<t> dz 

Laplacian 

V2 f = l*_(r°f\ + L*L + *L 

J rdr\ dr) + r 2 d<p 2 dz 2 

curl 

(V x F) r = ^ 

r 30 dz 


„ dF r dF z 

(VxF) ^i7-^r 

19. , 1 dF r 
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FIGURE 2.16 Spherical polar coordinates r, 0, (p. The unit vectors r, 9, and (f> at P 
point in the directions of increasing r, 9 , and <p, respectively. 


drical coordinates. Looking at Fig. 2.15 we see that 

x = rr + zk. (2.64) 

We may also express ( x , y, z) in (r, <p, z): 

x = rcos<p, y = rsin0, z = z. (2.65) 

Spherical Coordinates 

Spherical polar coordinates are u\ = r, ui = 6, and uj = <p, as defined in 
Fig. 2.16. The formulas for differential operators in spherical coordinates are 
recorded in Table 2.4. Again, it is important to know the position vector in spher¬ 
ical coordinates, 15 

x = r r. (2.66) 

We may also express ( x , y, z) in (r, 0, <p), 

x = r sin0cos0, y = rsin0sin0, z = rcosd. (2.67) 


EXAMPLE 7 The unit basis vectors ?, 6, (p of spherical coordinates are vector 
functions of the position x. For each of these vector functions calculate the diver¬ 
gence and curl, and interpret the results geometrically. 

1. Radial. For the function F(x) = ?, the components are F r = 1, F$ = = 0. 

Using the divergence and curl in spherical coordinates (Table 2.4) we find 

15 Failure to appreciate this little equation is the source of much confusion. A common mistake is to 
write rr + 80 + (p<p, which doesn’t even have consistent units! 
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TABLE 2.4 Spherical coordinates. 


position 

x = rr 

displacement 

ds = r dr + 6rd6 + 0r sin 9dtp 

scale factors 

h r = l,hg = r,hff, = r sin 0 

volume element 

dV = r 2 dr sin 6d8 dtp 

gradient 

V/=r 9/ + «I 9/ + 0 1 9/ 

dr r dd r sin# dtp 

divergence 

„ 13/2 \ 1 3 . 

V ' F - r 2 9r ('-0 + , sl „9 8 «<“" eF »> + 


l 9/y, 
rsin# dtp 


Laplacian 

curl 


2 , 1 a / 2 3 / \ 1 3 / 3 /\ 

<v * F >» = H^i£~s :(rF ' ,) ] 

r] 


F), = i 


3 , „ , 3 Fr 
dr ^ 6 d6 


1 3 2 / 

r 2 sin 2 (9 30 2 


V • r = 2/r and V x f = 0. These results make sense geometrically, because 
the tangent curves of r are radial lines, which diverge (spread out) but have no 
vorticity. 

2. Polar angle. For the function F(x) = 0, the components are Fq = 1 , F r = 

— 0. The divergence and curl of this vector field are V • 6 = (cot 8) / r and 

V x 6 = 0/r. These results make sense geometrically. The tangent curves of 
0 are circles with constant longitude. These curves diverge at all points except 
the equator (8 = tt /2) where V • 0 = 0, and diverge most strongly near the 
poles (8 — 0 or n) where V • 6 = ±oo. Also, they curl around the direction <p 
(draw a picture that shows this!), which explains why V x 6 oc 0. 

3. Azimuthal. Forthe function F(x) = 0, the components are = 1, F r = Fq = 
0. The divergence and curl of this vector field are V • 0 = 0 and V x 0 = 
(f cot0 — G)/r = k/(r sin 0). (The last result follows from the relation k = 
r cos 8 — 6 sin 0, between Cartesian and polar coordinates.) These equations 
make sense geometrically. The tangent curves of 0 are circles with constant 
latitude. The tangent curves of 0 curl around k, but do not diverge at any point. 
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2.5 ■ THE HELMHOLTZ THEOREM 

We end this chapter with a theorem in vector calculus that requires an understand¬ 
ing of the basic concepts defined earlier. 


Preliminaries 

A vector function (or field) F(x) that has zero curl, V x F = 0, is called ir- 
rotational. For instance, the electric field of any collection of static charges is 
irrotational. A vector function G(x) that has zero divergence, V • G = 0, is called 
solenoidal. For instance, any magnetic field is solenoidal. 

Any vector function H(x) can be written as the sum of an irrotational function 
F(x) and a solenoidal function G(x). The functions F and G are not necessarily 
unique. However, in some cases, if suitable boundary conditions are imposed, 
then the decomposition H = F + G is unique. 

An irrotational field (V x F = 0) can be written as a gradient, F = —Vfi. (We 
proved in Sec. 2.2.4 that the curl of Vi/r is 0.) A solenoidal field (V • G = 0) can 
be written as a curl, G = V x A. (We proved in Sec. 2.2.4 that the divergence 
of V x A is 0.) So, the decomposition of an arbitrary vector function H(x) into 
irrotational and solenoidal parts becomes H(x) = — Vi/r + V x A. 

The Helmholtz Theorem 

The preliminaries bring us to the Helmholtz theorem, a general result in vector 
calculus with important applications in theoretical physics. Let H(x) be differen¬ 
tiable at all points in space, with divergence V ■ H = d(x) and curl V x H = c(x). 
If d(x) and c(x) approach 0 faster than r ~ 2 as r —> 0, and H(x) approaches 0, 
then H(x) = — Vi/r + V xA where 


t<*> =J 

r d(x')d 3 x' 

‘ 4tt |x — x'| ’ 

(2.68) 

A(x) =J 

r c(x')d 3 x' 

‘ 4zt |x — X'l' 

(2.69) 


The integration region is all of space. The proof of the theorem is given in Ap¬ 
pendix B, but it requires some techniques that will be developed in Chapter 3. 

The theorem has important applications in electromagnetic field theory, and 
also in other areas of theoretical physics. It implies that if the divergence and 
curl of a vector function (field) are known, then the function can be determined 
uniquely (under the assumptions of the theorem) by (2.68) and (2.69). 


FURTHER READING 

Either of the two books below provide a brief but sufficient introduction to vector 
calculus. 
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1. Paul C. Matthews, Vector Calculus (Springer-Verlag, Berlin, 1998). This book is espe¬ 
cially recommended for learning suffix notation, which is important in advanced areas 
of theoretical physics and useful for deriving vector identities. 

2. H. M. Schey, Div, grad, curl, and all that: an informal text on vector calculus (Norton, 
New York, 1992). 


EXERCISES 

Sec. 2.1. Vector Algebra 

2.1. Determine the angle, in degrees, between a diagonal of a cube and an adjacent edge. 
(Hint: Use the unit cube 0 < x , y, z < 1 ; one diagonal is along i + j + k, and an 
adjacent edge is along k.) 

2.2. (a) Prove that | A x B| is equal to the area of the parallelogram defined by A and B. 

(b) Why is the equation A x B = AB sin 6 false? 

(c) Prove that A ■ (B x C) is equal to the determinant 

Ax Ay A z 

B x By B z 

C X Cy C z 

(d) Show that A • (B x C) is equal to the volume of the parallelepiped defined by 
A, B, and C. From this result prove that A ■ (B x C) = (A x B) ■ C. 

2.3. (a) Prove equation (2.27). (Hint: Convince yourself that (2.27) has 81 components 

in general. However, it is sufficient to consider just those with i = 1 by symme¬ 
try. Then examine various combinations of jtm, and show that the left and right 
hand sides of (2.27) are equal in all cases.) 

(b) Prove the identity 

(A x B) • (C x D) = (A ■ C) (B • D) - (A • D) (B ■ C). 

2.4. Prove thatif Sy = Sjj and Aij = —Aji, then SijAij = 0. The summation conven¬ 
tion is understood. In words, the contraction of a symmetric tensor and an antisym¬ 
metric tensor is 0. (“Contraction” is a bit of jargon from tensors; it means the sum 
over dummy indices.) 

2.5. (a) Write the 3 x 3 rotation matrix R for the rotation by angle 6 about the z axis, 
(b) Verify that the result of (a) is an orthogonal matrix; that is, R~ l = R 1 . 

Sec. 2.2. Vector Differential Operators 

2.6. Prove these identities; 

(a) V • (gF) = gV•F + Vg■F 

(b) V x (gF) = gV x F + Vg x F 

(c) V • (F x G) = G • (V x F) — F • (V x G) 

(d) V x (V x F) = V (V ■ F) - V 2 F. 
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x 

FIGURE 2.17 Exercise 7. A certain scalar function is given in the region bounded by 
the axes and the hyperbola. 


2.7. Consider the function /( x , y) = Cxy in the region bounded by the half-planes 

half plane 1 = xz plane (y = 0) with x > 0, 
half plane 2 = yz plane (x = 0) with y > 0, 

and by the hyperbolic sheet xy = a 2 , as shown in Fig. 2.17. (z is unbounded.) 

(a) Show that f(x, y) satisfies Laplace’s equation, V 2 / = 0. 

(b) Calculate the boundary values of / on the three surfaces. 

(c) Calculate V/, and sketch the tangent curves of the vector function V /. 

2.8. (a) Calculate V<t> where <t>(x) = p • x/r 3 . Here p is a constant vector, and r = |x|. 

(Hint: <t> is fg where / = p • x and g = r~ 3 .) Sketch the tangent curves of the 

vector function V<t>, forp = pk. 

(b) Calculate V x A where A(x) = m x x/r 3 , and m is a constant vector. Sketch 
the tangent curves of V x A, for m = mk. 

This is a hard problem, but important. The tangent curves of V<t> and V x A look the 
same. <t> has the form of the scalar potential of a pointlike electric dipole, and A has 
the form of the vector potential of a pointlike magnetic dipole (apart from constant 
factors). 

Sec. 2.3. Integral Theorems 

2.9. (a) Sketch a picture of the vector field F(x) = kxx. 

(b) Calculate directly the line integral of k x x around a circle in the xy plane, 
centered at 0 with radius a. 

(c) Calculate the line integral by using Stokes’s theorem. 

2.10. (a) Sketch a picture of the vector field F(x) = x. 

(b) Calculate directly the flux of F(x) outward through the surface of the unit cube 
defined by 0 < x, y, z < 1. 

(c) Calculate the flux by using Gauss’s theorem. 
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2.11. (a) Use Gauss’s theorem to prove that 

f (V x F) • d\ = — f (V x F) - dA 

JSi Js 2 

where Si and S 2 are open surfaces with the same boundary curve C. ( dA points 
outward from the volume enclosed by S] (J S 2 .) 

(b) Prove the same result from Stokes’s theorem. 

Sec. 2.4. Curvilinear Coordinates 

2.12. Calculate the gradient V/ and the Laplacian V 2 / of these scalar functions: 

(a) / = x 2 + ;y 2 + z 2 , 

(b) / = (x 2 + y 2 + z 2 ) / , 

(c) f = (x 2 + y 2 + z 2 ) / . 

Do each calculation two ways: using Cartesian coordinates, and using spherical polar 
coordinates. You must get the same answer either way. 

2.13. Calculate the divergence and curl of these vector functions: 

(a) x, (b) x/r, where r = |x|, and (c) k x x. 

Do each calculation two ways: using Cartesian coordinates, and using spherical polar 
coordinates. 

2.14. A useful mathematical skill is to be able to expand the unit vectors of one system of 
coordinates in the unit vectors of another system. For Cartesian coordinates ( x, y, z ) 
and spherical polar coordinates (r, 9, <j >) use Fig. 2.16 to prove: 

(a) 

i = r sin 9 cos <p + 0 cos 9 cos <p — 4> sin <p 
j = r sin 9 sin <f> + 0 cos 9 sin <p + <t> cos <p 
k = r cos# — Osin 9 

(b) 

r = i sin © cos<£ + j sin# sin<£ + kcos# 

0 = i cos 9 cos <p + j cos # sin 0 — k sin # 

<p = — i sin <p + j cos <p 

Sec. 2.5. The Helmholtz Theorem 

2.15. Consider fluid flow in the xy plane directed radially away from the origin, so the 
fluid velocity is v = v r r. [The point (0, 0, 0) is a fluid source.] Suppose that as a 
function of distance r from the origin, v r (r) = Cr p , where C and p are constants. 

(a) For what values of C and p can v be expressed as a gradient, v = V^r? 

(b) Find i>(x) for arbitrary C and p. Treat the case p = — 1 separately. 
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2.16. It is possible to construct vector functions that have both divergence and curl equal 
to 0. Consider 


F(x, y) 


-yj +*j 

x 2 + y 2 


which is defined for the domain T> of all points (x, y) ^ (0, 0). 

(a) Show that V ■ F = 0 and V x F = 0. 

(b) Now calculate ^ F ■ dt for the unit circle in the xy plane. The integral is ^ 0. 
Why is this not a counterexample to Stokes’s theorem? 

2.17. Let H(x) = x 2 y i + y 2 z.j + z 2 xk. Find an irrotational function F(x) and a solenoidal 
function G(x) such that H = F + G. 

2.18. F = — Vi/r is irrotational, G = V x A is solenoidal, and H = F + G. Show that i/r 
and A satisfy the equations 

V 2 Vr = -V H, 

V (V • A) — V 2 A = V x H. 


[The solutions to these equations are (2.68) and (2.69).] 


General Exercises 

2.19. Consider the vector function F(x) = Cx, where C is a constant. 

(a) What is the divergence of F(x)? 

(b) What is the flux of F through a cubic surface of size i x i x l centered at the 
origin? 

2.20. Consider the vector function G(x) = k x x. 

(a) Sketch the tangent curves of G(x). 

(b) What is the circulation <f G • d£ around the unit circle C in the xy plane? 

2.21. A vector function F(x) has the form F(x) = f(r)x, where r = |x|. 

(a) Prove that V x F = 0. 

(b) Now suppose also V ■ F = 0. What is the most general form allowed for f(r)l 

2.22. Consider the scalar function 

h(x) = (x x A) • (x x B) 

where x = xi + yj +zk and A and B are constant vectors. Show that the gradient 
of h(x) is 

V/r = A x (x x B) + B x (x x A). 

Also, verify this general result for the special case A = a i and B = b j . 

2.23. As usual, x = x i + y j + zk and r = |x|. Let A and B be arbitrary constant vectors. 
Prove these two results: 
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WA.v(i)—^ 

(b)B- v[a 

2.24. Consider the vector function F(x) = 4>. Calculate f c F ■ d£ and /// V x F • r/A. 
where C is the circle of radius r in the xy plane centered at the origin, and H is the 
hemisphere above the xy plane with boundary curve C. (Take the integral over C to 
be counterclockwise.) Also, calculate f D V x F • d\ where D is the disk in the xy 
plane bounded by C. Verify Stokes’s theorem in both cases. 

2.25. Consider the vector function 

x r r 

F(X) = (r 2 +e 2)3/2 = (r 2 + ,2)3/2' 

(r is the spherical radial coordinate.) Calculate the divergence of F(x), and sketch a 
plot of the divergence as a function of r, for e « 1, « 1, and » 1. Calculate the flux 
of F outward through a sphere of radius R centered at the origin, and verify that it 
is equal to the integral of the divergence inside the sphere. Show that the flux is 4n, 
independent of R, in the limit e —0. 

The limit e —*■ 0 is very singular! The function V F approaches an infinitesimally 
narrow function of r with volume integral 4 jt . The mathematics of this example is 
important in electromagnetism. In the limit e —*■ 0, F(x) has the form of the electric 
field of a point charge at rest at the origin. 

2.26. Consider the scalar function <t>(x) = C • x/r 3 , where C is a constant vector. In a 
coordinate system with C = Ck, <t> may be written as 

Cz C cos# 

( x 2 + y 2 + z 2 ) 3 / 2 r 2 

using Cartesian and spherical polar coordinates, respectively. 

(a) Calculate — V<t>, which is the vector field E(x). You may use either Cartesian or 
polar coordinates. 

(b) Determine the Cartesian direction of E at a point on the x axis. 

(c) Determine the Cartesian direction of E at a point on the z axis. 

(d) Sketch the tangent curves of E. 

2.27. Consider the vector function F(x) = x/r 3 . (As usual, r is the magnitude of x.) 

(a) Calculate the flux of F through a sphere of radius a, centered at the origin. 

(b) Calculate the flux of F through a planar disk of radius a, parallel to the xy plane 
and centered at the z axis at z = H. 

(c) Determine the numerical value of the result of (b) for H — a. 

2.28. (a) Calculate the divergence of the function 

G(x) = xe~ ar . 


(b) Sketch a graph of V • G as a function of r. 
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(c) Determine the volume integral of V - G inside a sphere of radius a. (Hint: Use 
Gauss’s theorem.) 

(d) What is the volume integral of V • G over the infinite volume? 

2.29. Prove Green’s theorem : If /(x) and g(x) are any two scalar functions, then 

j> s (/Vg - gVf) d\ = (/V 2 g - gV 2 /) d 3 x 

where 5 is a closed surface and V the enclosed volume. 


Computer Exercises 

At the end of most chapters there are a few exercises that require the use of a 
computer. Convenient software for these exercises is Mathematica or Maple, be¬ 
cause these programs integrate analytic calculations, numerical calculations, and 
graphics. However, other programs, e.g., Matlab, Mat head, or old-fashioned pro¬ 
gramming languages like Fortran can be used if Maple and Mathematica are not 
available. Some of the simpler exercises can probably be done with a graphing 
calculator, although that is a relatively limited technology. 

To get started doing computer calculations in field theory, here is an exercise 
in elementary functional analysis. 


2.30. Consider a charged line segment with constant charge per unit length X and length 
2 i, located on the z axis from z = — t to z = +£ . The electrostatic potential through¬ 
out the space around the line segment is the function 


V(x,y,z) 


X z + 1 + y/x 2 + y 2 + (z + f) 2 

In ... . rrr: 

4rreo z — i + f x 2 + y 2 + (z — l) 2 


This function looks pretty complicated, so we use a computer to study it. 

(a) Let l be the unit of length. Plot V (x, 0, 0) in units of A = X/(4.tco) for* in the 
domain (0, 5). 

(b) The electric field is E(x) = — VP. Plot E x (x, 0, 0) in units of A/l for x in the 
domain (0, 5). (If Mathematica or Maple is used, the derivative can be calculated 
analytically by the program. The result may be reduced to a simple form.) 

(c) Make a log-log plot of E x (x, 0, 0) vs x, over a sufficiently large range of x, and 
show that the functional dependence is x ~ 1 for* <3C i and x~ 2 for* 3> £. We 
are often interested in such asymptotic expansions in field theory. 



CHAPTER 



Basic Principles of Electrostatics 


The theory of electromagnetism encompasses all electric, magnetic, and optical 
phenomena. To learn such a far-reaching theory we must start someplace. We 
start with electrostatics, which is conceptually the simplest part of the full theory. 
Many of the theoretical ideas and mathematical techniques that we will acquire in 
studying electrostatics will be used again in the other parts of the theory. 

Everybody who comes to this subject has already had a lot of experience with 
electricity and magnetism, both from applications, such as electric lights, motors, 
telephones, radio, and television, as well as from natural phenomena, such as light, 
lightning, static discharges, and Earth’s magnetic field. Often the motivation for 
studying electromagnetism is a desire to understand the physics underlying these 
experiences. It is therefore natural to expect a book on electromagnetism to start 
with discussions of such familiar devices and phenomena. However, those discus¬ 
sions must be postponed until later. We must start with electrostatics to lay the 
foundation for later understanding of these more complicated things. Devices are 
made by people, so it’s fair to say they are understood; but some natural phenom¬ 
ena are only partly understood and remain the subjects of current research. 

Electrostatics is the science of the interactions between electric charges and the 
electric field in static systems, i.e., systems that do not change in time. Strictly, 
the charges must be at rest, and the fields must be constant. Practically, we also 
use the equations of electrostatics for systems that change slowly in time, a rather 
forgiving approximation called the quasistatic approximation. As we’ll see, this 
approximation permits us to apply electrostatic principles to many electromag¬ 
netic devices and phenomena. 


3.1 ■ COULOMB'S LAW 

All of electrostatics can be derived from two principles, Coulomb’s law and the 
principle of superposition. Coulomb’s law, which is a formula for the force be¬ 
tween two point-like charges, is based directly on experiment. The first accurate 
and detailed measurements of the force between charges were made by Coulomb 
in 1785. From these experiments, with a very precise torsion balance, Coulomb 
discovered that the force is inversely proportional to the square of the distance, 
and in the direction of the line joining the charges. Also, the force is proportional 
to the product of the charges. Writing an equation that expresses these facts, the 
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force on a charge q\ due to a charge <72 is 


Fi 



(3.1) 


where r is the distance between the charges and f is the unit vector pointing from 
<72 to q\. If we let r be the vector from <72 to q\, then r = |r| and ? = r / r. For q\ 
and <72 with the same sign the force is repulsive, in the direction of ?; for charges 
of opposite signs the force is attractive. 

It is useful to introduce another notation for the force equation, that displays 
explicitly the two positions. Let xi and X 2 denote the positions of <yi and qi with 
respect to some origin in space, as shown in Fig. 3.1. Then the relative vector r 
from <72 to q\ is r = xi — X 2 , and the force on q\ is 


„ Xi — X2 

Fj = Kqm—^ -%. (3.2) 

|xi -x 2 | J 

The constant of proportionality K depends on the system of units. In this book 
we use Standard International (SI) units, also known as rationalized MKSA units 
(for meter, kilogram, second, Ampere). The table below lists the units of some 
important quantities . 1 


Quantity 

Unit 

Abbreviation 

length 

meter 

m 

mass 

kilogram 

kg 

time 

second 

s 

current 

ampere 

A 

force 

newton 

N 

charge 

coulomb 

C 

potential 

volt 

V 

magnetic field 

tesla 

T 


In SI units the value of K in (3.2) is defined to be 


K = - 

47reo 


10“ 7 c 2 


Ns 2 


= 8.99 x l(f 


Nm 2 

"C 2 ~ 


(3.3) 


where c is the speed of light and eo is called the permittivity of the vacuum . 2 
The unit of charge, the coulomb (C), is defined by 1 C = 1 A s, where the unit of 
current, the ampere (A), is the fundamental electric unit. The ampere is defined 
in terms of the magnetic force between current-carrying wires, as we’ll learn in 

1 Other systems of electric and magnetic units exist, and the field equations depend on the choice 
of units. When reading the literature it is necessary to identify which system of units is being used. 
Appendix A compares SI and Gaussian units. 

2 In the rest of the book we will never use K , but always write l/4jreo explicitly. 
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FIGURE 3.1 Interacting charges. The forces are equal but opposite. For the force di¬ 
rections shown, q\ and <J 2 have the same sign. 


Chapter 8. The coulomb is defined such that l/(4;reo) has the value in (3.3). The 
numerical value is the force in newtons between two charges each of 1 C that are 
1 m apart. 

Coulomb’s law (3.1), the first principle of electrostatics, is an experimental 
fact. 


3.1.1 ■ The Superposition Principle 

The superposition principle merely states that the force on a charge q due to a set 
of charges [q\,q 2 ,q-i ,..., <y,v) is the sum of the individual Coulomb forces 


N 


f* = £ 

k =1 


qqic x-x k 
4ire 0 |x - x,t| 3 


(3.4) 


where x is the position of q, and x £ that of q^. 


3.2 ■ THE ELECTRIC FIELD 


3.2.1 ■ Definition 


The electric field E(x) is a vector-valued function of position. That is, there is a 
field vector at every point in space. 

The definition of E(x) is the force per unit charge that would be exerted on a 
small “test charge” q if it were located at x, in the limit q —> 0 


E(x) = lim -. 
<7-*o q 


(3.5) 


The test charge is taken to be small so that it does not affect the other charges. 
Because the force F on q is proportional to q, the electric field is independent of 
the test charge. This definition provides a technique in principle for measuring the 
defined quantity E(x): Take a small charge to x, and measure F/q. 
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The subject of this chapter is the interaction between charges and the field. 
The definition (3.5) tells us one aspect of the interaction: The electric field exerts 
a force on a charged particle q, given by 

F 9 = r?E(x) (3.6) 

where x is the position of q. This equation states how the field affects a charge. 
Next we describe the other aspect of the interaction—how charges affect the field. 


3.2.2 ■ Charge as the Source of E 

We regard E(x) as a real physical entity, not just a mathematical construction. 
From the way it is defined above one might think that E is just a mathematical 
technique for describing the electric force, but that is the wrong way to think of 
it. We know that light consists of fluctuating electric and magnetic fields, and 
no one would dispute the statement that light is a real physical entity, not just a 
mathematical construction! Field theory was developed by Faraday and Maxwell 
as an alternative to “action at a distance.” The field is something real that extends 
throughout a volume of space and exerts forces on charges in the space. Now the 
question is, where does this entity, the field, come from? What is its sourcel 
Electric charge is one source of electric field. In electrostatics, charge is the 
only source. 3 Comparing (3.5) and (3.4) shows immediately that the field created 
by a set of static point charges [q \, q 2 , ■ ■ ■, gyv) is 


N 

E(*) = £ 


k =1 


qk x - x k 
47re 0 |x-x*| 3 ' 


(3.7) 


We refer to x as the field point and to the Xk’s as source points. The vector 
(x — X/t)/|x — x*| is the unit vector in the direction from the source point toward 
the field point. 

The field as a function of position x, due to a single point charge q at position 
x q , is 


E(x) = 


q r 

4;reo r 2 


(3-8) 


where r = x — x q . Charge acting as a source of E is the other aspect of the 
interaction between charges and the electric field. Equation (3.8) states how a 
charge affects the field, and (3.6) states how the field affects a charge. 


EXAMPLE 1 Figure 3.2 shows six identical charges, one at each vertex of a reg¬ 
ular hexagon in the xy plane centered at the origin. 

3 In Chapter 10 we will learn that another source of E is a magnetic field that changes with time. But 
that is a dynamic, rather than static, phenomenon. 
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FIGURE 3.2 Example 1. Six charges at the comers of a regular hexagon. 


1. What is the electric field at any field point x in the xy plane? 

Let the charges be numbered with the index k from 1 to 6. Then the position 
of the &th charge is 


x 


/ 

k 


t kn ? . kn 

la cos-h l a sin —. 

3 3 


(3.9) 


The field for arbitrary x is then obtained from (3.7) as 


E(x) = 


(x — a cos kn/3) i + (y — a sin&7r/3) j 
4?re 0 [(x _ a coskn/3) 2 + (y — a sin^/3) 2 ] 3 ^ 2 


E 


(3.10) 


where a is the distance from the origin to any of the charges. The field is 
singular, i.e., goes to infinity, at the six charges, but is finite everywhere else. 
2. What is the field on the x axis? 

We set y = 0 in (3.10) and obtain 


£,U,0) 

Ey (x , 0) 


E 


(x — a cos^^/3) 


k=i [ x 2 — 2ax coskn/3 + a 2 ] 


21 3 / 2 ’ 


—qa 


E 


sin^^/3 


[x 2 — 2ax cos kx/3 + a 2 ] 


21 3 / 2 


0 . 


(3.11) 

(3.12) 


It is clear that E y (x ., 0) must be zero, because of the symmetric positions of 
the charges; in the sum in (3.12) the terms with k = 3 and 6 are 0, and the 
other terms cancel in pairs. 

At the origin, E is 0, which is easily seen from the equations or just physi¬ 
cally because the fields due to charges on opposite vertices cancel. 
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3. As an example of asymptotic approximation , what is the field on the x axis far 
from the origin, accurate through order x -4 ? 

The y component is zero on the x axis. For x a, the x component must 
be calculated by expanding (3.11) in a power series in the small quantity a/x, 
using the method of Taylor series. The result is 


E x (x,0) 


I I , 9( l a2 

4ti€o I x 2 2 x 4 


(3.13) 


The leading term is, as expected, the same as for a point charge 6 q at the origin. 

4. If the 6th charge is removed from the group, what is the field at the origin for 
the remaining 5 charges (k = 1,2,..., 5)? 

We could start from (3.11) but sum only from k = 1 to 5. However, the 
superposition principle enables us to answer the question more easily if we 
realize that the field of charges 1 to 5 is the same as the field of charges 1 to 
6 superposed on the field of a charge — q at (a, 0). But the field at the origin 
due to the original 6 charges is 0, so the field of charges 1 to 5 is (note the 
sign!) 


^charges (0> 0) 


+<7i 

47reoa 2 


(3-14) 


3.2.3 ■ Field of a Charge Continuum 

Charge resides in elementary particles—electrons and protons—so any field may 
be attributed to discrete, point-like sources, as in (3.7). However, these micro¬ 
scopic charges are incredibly small, and much smaller than the relevant length 
scales of any macroscopic system. In a macroscopic system it is a good approxi¬ 
mation to replace the discrete elementary charges by a continuous distribution of 
charge. We denote by p(x) the volume charge density (charge per unit volume) as 
a function of position x, defined with respect to volumes that are small compared 
to the whole system, but large compared to the distances between individual par¬ 
ticles. That is, the density p(x') at a source point x' is the total charge in a small 
volume SV' around x', divided by SV'. By a “small volume” we mean small on 
the macroscopic scale; but the number of elementary particles in S V' is very large. 
Making this continuum approximation, the sum in (3.7) may be replaced by an in¬ 
tegral over the volume of the charge distribution 

E(x) = -L- [ r X ~* p(xVV- (3-15) 

4jre 0 J |x-x'| J 

Equation (3.15) is a fundamental equation of electrostatics. Given the charge den¬ 
sity p(x') throughout the source, (3.15) determines the electric field E(x) at the 
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“field point” x. In (3.15) we use the convention that primed coordinates represent 
source points, and unprimed coordinates represent field points. 

We might write the equation (3.15) more succinctly as 



which, however, has the disadvantage that it does not display explicitly the field 
point and the source point. The meaning of ? and r is not manifest, and one must 
remember that r is the distance between the source point and the field point, and 
? the direction vector. 

In (3.15) we consider the charge to be distributed in three dimensions, with 
volume density p (x') at x'. That is, p(x')d 2 x' is a small charge element dq'. If the 
charge is distributed over a 2-dimensional surface, then dq' is o{x')dA' where 
dA' is an area element and er(x') is the surface density, which has units C/m 2 . If 
the charge is distributed over a 1-dimensional curve or line, then dq' is A (x')di', 
where di' is an infinitesimal line segment and A(x') is the linear density, which 
has units C/m. In these latter cases the field E(x) would be expressed as a surface 
integral, or a line integral, over the charge distribution. 

Whether one should use (3.15) or (3.7) depends on the system of interest. To 
describe elementary particles within a molecule (3.7) is appropriate. To describe 
the field of a macroscopic charge distribution, for example a van de Graaff gen¬ 
erator, (3.15) is appropriate. In this book we are mainly concerned with classical 
systems, which are necessarily much larger than an atom, so the continuum ap¬ 
proximation is more often relevant here. The next three examples illustrate the 
use of (3.15) to calculate the electric field. 


EXAMPLE 2 What is the electric field on the midplane of a uniformly charged 
thin wire of length 2 11 The charge per unit length is A. 

Figure 3.3 shows the wire extending from (0, 0, —£) to (0, 0, +£) along the z 
axis. Because of axial symmetry it is sufficient to find the electric field on the x 
axis. As shown, the charge element A dz' produces field dE at the point (x. 0, 0). 
Because the distribution is symmetric about z = 0, the resultant field due to the 
entire wire will be in the x direction. From (3.15) the x component of E is 


1 r* x\dz! X£ 

E x (x, 0,0) = -- / - Tn = - , 

4tt J-e (,r 2 + z' 2 ) Ineox^/x 2 + i 2 


(3.17) 


We might derive this result in another, more geometrical way. The contribution 
dE x due to dq' = \dz! is i • dE = dE cos 6, where cos 6 = x/^/x 2 + z! 2 . Then 
E x is / cos 0dE, leading again to (3.17). 

Note that the field in the midplane is radial. Generalizing to any point on the 
xy plane, at distance r from the z axis, 
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FIGURE 3.3 Example 2. A charged wire. (0, 0, z') is a source point and ( x , 0, 0) is a 
field point. 


E(r) = 


AEr 

27re 0 rvV 2 + E 2 ’ 


(3.18) 


where r is the radial unit vector in the xy plane. 

It is interesting to consider the limiting behavior of this field for far points and 
near points. Far from the line charge, i.e., r E, the field is E r = AE/(2;r€or 2 ). 
As we’d expect, this is the same as the field of a point charge q = 2EA at the 
origin; from far away the line charge looks, to a first approximation, like a point. 
Near the wire, i.e., r E, the field is E r = X/(2jreor). 

If the line charge is infinitely long, then the field on the x axis is obtained from 
(3.17) by extending the integral from —oo to +oo. Evaluating that integral gives, 
for any point in the midplane, 


E(r) = (3.19) 

2;reo r 

This is the same result that we found for a finite line charge for points near the 
line. From close enough the finite line looks, to a first approximation, infinitely 
long. 


EXAMPLE 3 What is the electric field on the axis of a circular loop of uniformly 
charged thin wire with total charge ql Let a be the radius of the wire circle. 

In Figure 3.4 the wire is in the xy plane centered at O. The charge element dq 
produces field dE at the point (0,0, z), as shown. Because the charge distribution 
is axially symmetric, the resultant field of the entire wire is in the z direction, 
and dE z = dE cos £ = dE(z/Va 2 + z 2 ). Each charge element makes the same 
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z 



FIGURE 3.4 Example 3. A circular loop of wire. dE is the field of the elemental 
charge dq. 


contribution to dE z because they are all at the same distance Va 2 + z 2 from the 
field point. Therefore the integrand in (3.15) is a constant, and we have 


E z ( 0, 0, z) = 


qz 


4jreo ( a 2 + z 2 ) 


2 ) 3 / 2 ’ 


(3.20) 


A charged disk, or a charged plane, can be built up from elemental annuli, so 
(3.20) can be integrated over the loop radius to find the field of a disk or plane. 4 5 


EXAMPLE 4 What is the electric field E(x) due to a uniformly charged spherical 
shell of radius R with total charge <2? The shell thickness is negligible, and the 
surface charge density is a = Q/(4nR 2 ). 

Figure 3.5 shows the shell with its center at O. Because of the symmetry the 
field will be spherically symmetric, E(x) = E r (r) r, where r is the unit radial 
vector. It is sufficient to find the field at P on the z axis, where r = z and r = k. 

Let the shell be built up from elemental annuli, as shown in Fig. 3.5. On the 
z axis we know the field of an annular ring from Example 3. Consider now the 
annular ring at polar angle 0. 5 Its radius is R sinO. Its width is RdO, so it carries 
charge dQ — a 2nR 2 sin Odd. The perpendicular distance from this annular ring 
to P is r — R cos 0. Thus, using (3.20) for the annuli and integrating over the 
entire sphere from 6 — 0 to n. 


E r (r) = 


oR l 


(r — R cos 0) sin Odd 


2 f o Jo [/? 2 sin 2 6 + (r — R cos/)) 2 ] 


3/2 ' 


(3-21) 


4 See Exercise 6. 

5 We often use the convention that source-point coordinates are primed. Here, however, because E r 
does not depend on angular coordinates, there is no ambiguity in letting 9 (without a prime) be the 
angular coordinate of an elemental ring in the source. Because of tricky notational details like this, it 
is important to make a sketch with the variables labeled for any field calculation. 
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z 

P 


X 


FIGURE 3.5 Example 4. A spherical shell. The elemental ring has radius R sin 9 and 
area dA — 2n R 2 sin BdB. 



Changing the variable of integration to u = cos 9 gives an integral that can be 
evaluated from a table of integrals, or from an analytic computer program; the 
result is 


oR 2 T 1 (r - Ru)du 
2e 0 i-l (r 2 + r 2 -2rRu) 3/2 


oR 2 
2 t 0 r 2 


r - R 
V(r - R) 2 



. (3.22) 


Equation (3.22) is valid both inside and outside the sphere if the positive value is 
taken for the radical -J(r — R) 2 . That is, outside the sphere (r > R) the radical is 
r — R, and inside the sphere (r < R) the radical is R — r. Therefore the electric 
field is 

E(x) = f for r > R ’ (3.23) 

I 0 for r < R. 


The final result is remarkably simple. The field outside the spherical shell is the 
same as if all the charge were concentrated at its center. And the field inside the 
spherical shell is 0, because the vector sum of the Coulomb fields from all the 
surface elements is 0. 


It is not too hard to extend the conclusions of the last paragraph to any finite 
spherically symmetric charge distribution, i.e., for any charge density p(r') that 
depends only on radial distance. Any such charge dislribution can be considered 
to be made up of elemental spherical shells. The field outside the distribution 
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is the same as if all the charge were at the center, because that is true of every 
elemental shell. Also, the field at a point inside the charged region, say at distance 
r from the center, depends only on the charge p(r') at points r' < r, because the 
elemental shells with r' > r have zero field at r. 

An interesting historical note is that the gravitational field problem analogous 
to the problem we have just discussed, namely to find the field outside and in¬ 
side a spherical distribution of mass, stymied Isaac Newton as he was developing 
integral calculus. Fortunately for the development of science, he did eventually 
solve it. 

The field inside a spherical shell of charge is 0, i.e., the vector sum of the 
Coulomb fields of the surface elements is 0, because the Coulomb field is in¬ 
versely proportional to the square of the distance, d’E oc r/r 2 . If the field obeyed 
any other power law, then E would not be 0 inside the sphere. (It would be 0 at the 
exact center of the sphere by symmetry, but not throughout the interior.) This spe¬ 
cial property of the inverse square law provides the most accurate experimental 
test of Coulomb’s Law, which will be discussed in Chapter 4. 

We will return to Examples 2, 3 and 4 later, to illustrate other methods for 
calculating the electric field. The direct evaluation of the field, by integrating over 
the charge distribution, is often not the easiest way to determine the field. In fact, 
in many examples it is the hardest way to do the calculation! 


3.3 ■ CURL AND DIVERGENCE OF E 

From (3.15) we can derive two differential relations, that must be satisfied by any 
static electric field, for the curl and divergence of E(x). We will show that 


V x E = 0, 


(3.24) 


and 


V • E = p/e o. 


(3.25) 


Equation (3.25) is called Gauss’s Law, in the form of a differential equation, and 
it is Irue for any electric field including time-dependent fields. Equation (3.24) is 
true for the field from any distribution of static charge, such as all those treated in 
this chapter. Later, when we consider electromagnetic systems that vary in time, 
we will replace (3.24) by the equation for Faraday’s Law. 

To prove these relations, we first derive an alternate form of (3.15). Note that 


x ~ x ' = y 1 
|x —x'| 3 |x — x'l 


(3.26) 


To verify this equation is a straightforward exercise in vector calculus: The gradi¬ 
ent on the right-hand side is, by the chain rule of differentiation, 
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-1 


X —X' X —X' 


Vlx-x'l; 


(3.27) 


and for any Cartesian component i. 


V/|x - x'l = — y (*t - x[) 2 + (x 2 - x' 2 ) 2 + (X3 - X3) 2 


2|x — x' 


- 2 (X{ - X-). 


(3.28) 

(3.29) 


Combining these results yields (3.26). So, replace r/r 3 in (3.15) by —V(l/r), 
where r = x — x'. The gradient with respect to x can be pulled out of the integral 
over x', so 


E(x) = -V 


1 

47re 0 


f p{x!)d 2 x' 

J |x -x'l ■ 


(3.30) 


In this alternate form of (3.15), E(x) is written as the gradient of a scalar function. 
That V x E is 0 follows immediately, because we proved in Chapter 2 that the 
curl of the gradient of any scalar function is identically 0. Hence (3.24) is proven. 
In electrostatics the field is irrotational. 

We have written E(x) in terms of a scalar function V (x) as 


E(x) = -VV(x) 


where, as we see from (3.30), 


V(x) = 


1 r p(x')d 2 x' 

4rre 0 J |x-x'| 


(3.31) 


(3.32) 


The very important function V(x) is called the electric potential. We will have 
much more to say about V(x) in Secs. 3.5 and 3.6. According to (3.25) V(x) 
satisfies the equation 


-V 2 V = p/e 0 , 


(3.33) 


which is called Poisson’s equation. 

For (3.32) the charge is distributed in a 3-dimensional volume. If the charge 
is distributed over a 2-dimensional surface, or along a 1-dimensional curve, then 
pd 2 x' would be replaced by ad A 1 or Xdt', respectively. If the charge distribution 
consists of isolated point charges, then the potential is 


V(x) = 


1 

4rre 0 


N 


E 


Qk 

ix-x*r 


6 Here 1 = 1, 2, 3 correspond to x. y , z ; that is, * 1 = x , X 2 = y , *3 = 1 . 


(3.34) 
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The potential function may be used to solve problems where the charge distribu¬ 
tion is known, in the following way: Calculate V (x) from (3.32) or (3.34), or by 
solving Poisson’s equation; then obtain the field as E = — VV. This approach is 
often easier to accomplish than evaluating the vector integral in (3.15). 

The proof of (3.25) is addressed in Secs. 3.4 and 3.5. 

3.3.1 ■ Field Theory Versus Action at a Distance 

Action at a distance assumes that two charges, separated by any distance, exert 
forces on each other directly. Field theory is a different concept. Charges create 
the field. The field fills the volume, determined by the local relations (3.25) and 
(3.24) at every point in the volume. The force on a charge is exerted by the field 
at the position of the charge. All the action is local. 

3.3.2 ■ Boundary Conditions of the Electrostatic Field 

In later chapters we will study systems in which there are interfaces between 
different materials, e.g., between insulator and conductor, or between different 
dielectrics. It will be important to know how the field changes at such a surface of 
discontinuity. We can determine the boundary conditions from general consider¬ 
ations, based on the relations (3.24) and (3.25). 

Let S be an arbitrary surface in space. For example, one might think of the 
interface between different materials, or a surface separating two parts of an elec¬ 
trostatic system. Because V x E = 0, the tangential components E, of the electric 
field must be continuous across S. To prove this statement, consider an infinites¬ 
imal rectangular loop cutting through the surface, as in Fig. 3.6(a). By Stokes’s 
theorem the line integral of E around the loop is 0, because V x E = 0. In the 
limit that the two segments on opposite sides of S approach S, the line integral 
approaches 

E ,2 • dl 2 + E/i ■ di\ = (E ,2 — E,]) ■ ds 



FIGURE 3.6 Proof of boundary conditions, (a) An infinitesimal loop cutting through 
a surface S. Region 1 is below the surface and region 2 is above, ds is a small displacement 
on the surface, (b) An infinitesimal pill box cutting through S. n is the unit vector normal 
to the surface, pointing from region 1 to region 2. 
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where E/i and E ,2 are tangential components on opposite sides of S and ds is the 
segment on the surface. There is no contribution from the segments perpendicular 
to S because their lengths shrink to 0 in the limit (see Fig. 3.6(a)). Thus, because 
the line integral is 0, E,i must be equal to E ( 2 . That is, E f is continuous across S. 

Gauss’s Law implies that the normal component E n of the electric field is con¬ 
tinuous across S unless there is a surface charge, in which case the discontinuity 
of E n is a/eo- To prove this statement, consider an infinitesimal pill box cutting 
through the surface, as in Fig. 3.6(b). By Gauss’s divergence theorem the flux of 
E out of the pill box is (/enclosed / e o, because V • E = p/eo- In the limit that the 
two faces on opposite sides of S approach S, the charge enclosed approaches od A 
where dA is the areaof those faces. The flux approaches E n idA — E n \dA, where 
E n \ and E n 2 are the normal components on opposite sides of S. (The cylindrical 
area does not contribute to the flux in the limit that the height shrinks to 0.) Thus 
the discontinuity of E n is E n 2 — E n 1 = a/eo- 

We shall encounter many examples of these boundary conditions. 


3.4 ■ THE INTEGRAL FORM OF GAUSS'S LAW 

3.4.1 ■ Flux and Charge 

We have stated the differential equation (3.25) that relates divergence and density 
at every point throughout the volume. That equation is called the differential form 
of Gauss’s Law. 

The integral form of Gauss’s Law is a relation between the flux of E through 
any closed surface S, and the total charge Q enclosed by S 7 We may derive the 
integral relation from the differential relation, by applying Gauss’s divergence 
theorem. Consider the integral of V • E in an arbitrary volume V. By (3.25) this 
integral is Q/e 0. But by Gauss’s theorem the integral of V • E in V equals the flux 
of E through the surface S of V. That is, 

&E-dA = — f p(x)d i x = Q^^-, (3.35) 

Js fo Jv fo 

which is the integral form of Gauss’s Law. Equations (3.25) and (3.35) are math¬ 
ematically equivalent statements of the same physical principle—that charge 
causes divergence of the electric field. 

3.4.2 ■ Proof of Gauss's Law 

Gauss’s Law (3.35) is often used in physics to calculate electric fields, so it is 
important to prove that it comes from experiment—from Coulomb’s law. The 
proof is an interesting exercise in analytic geometry. To get the basic idea, let’s 
first prove it for the very special case of a point charge q and a sphere S of radius 

7 The integral form of Gauss’s Law is used in elementary textbooks of physics, and the reader may 
find it helpful to review the discussion in such a book. 
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R around q. The electric field at a point x on the sphere is 


E(x) = 


47TCo^ 2 


(3.36) 


The area element d A at x is dA = rR 2 dQ, where dQ is the solid angle subtended 
by d A. Thus the flux of E out through S is 

<bE-dA= -2— J)dn= —, (3.37) 

Is 47re 0 / e 0 

which agrees with Gauss’s Law (3.35) for this special case. 8 Note that the flux 
does not depend on the radius R of the sphere, because as R increases the area 
increases as R 2 while the field decreases as 1 /R 2 . 

Gauss’s Law is true for any charge distribution and any surface, so we must 
generalize the above proof. So now, let S be an arbitrary closed surface, and.again 
compute the flux out through S of the field due to a point charge q enclosed by S. 
The area element at a point x on S is 


dA = 


r~dQ „ 

-n. 

| cos#| 


(3.38) 


where n is the unit outward normal at x, dQ is the solid angle subtended by 
d A, and 6 is the angle between n and ?. (The unit vector ? points radially away 
from q.) Figure 3.7 shows the geometry. A projection factor 1/| cos 6\ is in (3.38) 
because dA, which is normal to S, is not generally parallel to the radial direction. 
Imagine a beam of light shining on a surface at an oblique angle; the illuminated 
area on the surface would be larger than the cross section of the beam by the same 
projection factor 1 /1 cos 6 \. 



FIGURE 3.7 Proof of Gauss’s Law. The solid angle subtended by the area element d A 
is d£2 — r ■ dA/r 2 . The electric field E due to the point charge q is in the direction of r. 

8 The differential solid angle in spherical coordinates is dQ = sin 8ddd<p. The integrated solid angle, 
over all angles 9 and 0, is 47r. 
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The flux through d A is 


E-dA = 


q r 2 dSl 

-r COS 8 - -— 

4^-eor 2 | cos 01 


qd£l 

4tre 0 


(3.39) 


where r is the distance from q to d A. The sign depends on the relative directions 
of ? and n, and is positive if q is enclosed by S. Finally, the flux through the 
whole surface is proportional to the integral over all solid angles, <f dQ = 4n. 
Hence • dA = q/e o- This proves that Gauss’s Law follows from Coulomb’s 
law. 

We have assumed that q is enclosed by S. On the other hand, if q is outside 
S, then the flux through d A is negative over part of the surface S (assuming q is 
positive), because d A is the outward normal. On the part of the surface closer to 
q, the radial vector has a negative projection on n, so there the flux is negative. On 
the part of the surface farther from q, the radial vector has a positive projection 
on n, so there the flux is positive. In this case the solid angle integral is 0 because 
the positive and negative sections of S subtend the same solid angle from q, so 
the net flux is 0. This result is also consistent with Gauss’s Law because q is not 
an enclosed charge. 

If there are many charges present, or a charge continuum, then Gauss’s Law 
is still true by the superposition principle. The electric field is the superposition 
of fields due to individual charges, or charge elements, and Gauss’s Law (3.35) 
holds for each charge in the system so it holds for the sum of charges. 


3.4.3 ■ Calculations Based on Gauss's Law 

For charge distributions with a high degree of symmetry, it is often possible to de¬ 
termine the field E(x) by direct application of (3.35), taking into account the sym¬ 
metry. We will consider three examples of this technique, with spherical, cylin¬ 
drical, and planar symmetry. 9 


EXAMPLE 5 What is the electric field due to a uniformly charged sphere? 

Let p be the charge density for r < a, where a is the radius of the sphere; 
uniformly charged means that p is constant, so the total charge is Q = |tra 3 p. 
By the spherical symmetry of the problem, E(x) must be in the radial direction 
(“diverging from the charge”) and the magnitude |E(x)| can only depend on the 
distance r = |x| from the center. That is, in spherical coordinates E(x) must have 
the form 


E(x) = E(r) r. (3.40) 

We want to find E(x) both inside and outside the sphere of charge. To fi nd the ex¬ 
ternal field apply (3.35) to the spherical Gaussian surface of radius r > a shown 
dashed in Fig. 3.8(a). The flux is <t>£ = 4 nr 2 E(r). For this case the charge en- 

9 When studying these examples, please sketch apictureof the charge distribution, field, and Gaussian 
surface for each calculation. 
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FIGURE 3.8 Gaussian surfaces for three symmetries, (a) Spherical symmetry; 
(b) cylindrical symmetry; (c) planar symmetry. The Gaussian surfaces are dashed. 


closed is Q, so 


E(r) = 


Q 

47ieor 2 


for r > a. 


(3.41) 


This is the same as the field for a point charge Q at the origin. The same result 
was obtained in Example 4 by integrating the Coulomb field. Generalizing this 
analysis, we see again that for any spherically symmetric charge distribution of 
finite radius, the electric field outside the charge is the same as if all the charge 
were concentrated at the center. 

To find the internal field, apply (3.35) to the spherical Gaussian surface with 
r < a shown dashed in Fig. 3.8(a). The flux is again 4nr 2 E(r), but in this case 
the charge enclosed is |?r r 3 p = Qr 2 /a 3 ; so 


E(r) = 


Qr 

47reoa 3 


for r < a. 


(3.42) 


Inside the sphere the field decreases as r decreases, and is 0 at the center. The 
reason is that the field at r depends only on the charge inside r. In other words, 
the contribution to the field at radius r from the charge density outside radius r 
is 0. 

In Example 4 we found that the field inside a uniformly charged spherical shell 
is 0, by integrating the Coulomb field. Gauss’s Law provides a simpler proof of 
this result. For a spherical shell the field must be radial, by symmetry. But there 
is no charge enclosed by a Gaussian surface drawn inside the shell. Therefore the 
field inside must be 0. 

We have obtained from Gauss’s Law the same results that we obtained ear¬ 
lier by integrating the Coulomb field. But it is important to appreciate that the 
calculations by Gauss’s law are much easier than evaluating the integral. 


EXAMPLE 6 What is the electric field of a uniformly charged line? The line is a 
mathematical idealization, both because the length is infinite and the diameter is 
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infinitesimal. But the result will be a good approximation for the field of a long 
charged wire for points near the wire far from the ends, i.e., for r « t where 
L is the length of the wire. Approximating the finite wire as an infinite line is an 
example of what is called “neglecting the end effects.” 

Let X be the charge per unit length. By the cylindrical symmetry of the prob¬ 
lem, and the fact that E diverges from charge sources, E(x) must have the form 

E(x) = E(r )?, (3.43) 


where r is the perpendicular distance to the line, and ? the radial unit vector 
of cylindrical coordinates. Now apply (3.35) to a cylindrical Gaussian sur¬ 
face of radius R and height h, shown in Fig. 3.8(b). (This Gaussian surface 
respects the cylindrical symmetry.) The flux through the cylinder surface is 
= InRhE (/?), and the charge enclosed is Xh, so 


E(R) = 


X 

2tt€qR 


(3.44) 


This result agrees with (3.19), obtained in Example 2 by integrating the Coulomb 
field, but the calculation from Gauss’s Law is much easier. 


EXAMPLE 7 What is the electric field of a uniformly charged plane? The infinite 
plane is another mathematical idealization, but the result is a good approximation 
for the field of a charged plate far from the edges. 

Let a be the charge per unit area, and take the z axis to be perpendicular to the 
plane. By the planar symmetry, and the fact that E diverges from charge sources, 
E(x) must have the form 


£(z)k for z > 0 

—£(|z|)k for z < 0, 


(3.45) 


pointing away from the plane in both regions. Now apply (3.35) to a cylindrical 
Gaussian surface whose end faces are at ±z and have area A, shown in Fig. 3.8(c). 
(This Gaussian surface respects the planar symmetry.) There is flux of E through 
the end faces, equal to <t>£ = 2 AE(z), and the charge enclosed is a A, so 

E{z) = . (3.46) 

2e o 

Note that the electric field vector is discontinuous at the charged surface, being 
+ko-/(2eo) above and —kcr/(2eo) below, with discontinuity kcr/eo. The discon¬ 
tinuity agrees with the boundary condition derived in Sec. 3.3. The field is inde¬ 
pendent of the height z above the plane because we have considered the field of 
an infinite plane. For a finite plate, far from the edges, the field is approximately 
constant for z much less than the size L of the plate; but for z > L the field 
decreases with z- 
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Gauss’s Law is always Irue, even for time-dependent problems. However, it 
is useful for calculating E only if the symmetry is high, as in these examples. 
Although limited, the Gaussian-surface trick is powerful. For the spherical case, 
as in Example 5, Gauss’s Law will give E for any spherically symmetric den¬ 
sity p(r). For the cylindrical case, Gauss’s Law will give E if the charge density 
depends only on the cylindrical radial coordinate r. If p varies with z, e.g., for 
a charge density of finite extent in z, the method cannot be used. In the planar 
case, Gauss’s Law will give E only if the surface charge density a is constant. Of 
course any solutions can be superposed; for example, one can find the field for 
two planes with different charge densities using the result of Example 7. Also, 
the solution for a highly symmetric system may be a useful approximation to a 
nonsymmetric case; for example, finite parallel plates may be approximated by 
infinite parallel planes if edge effects are neglected. 


3.5 ■ GREEN'S FUNCTION AND THE DIRAC DELTA FUNCTION 


The previous section shows how the integral form of Gauss’s Law can be used to 
analyze an electric field. However, in many ways the differential form (3.25) is 
even more useful. Or, writing E = — W, where V (x) is the electric potential, the 
function V (x) satisfies Poisson’s equation (3.33) which may be solved by various 
methods. 

To develop an appreciation for the differential form of Gauss’s Law, it is useful 
to study an important modem viewpoint, based on the Green’s function method. 
We need the fact that 1 /4zr |x — x'| is the Green’s function of the differential 
operator —V 2 . In this method, a point source is described by setting the density 
function equal to a Dirac delta function. A brief mathematical excursion is in 
order. 


3.5.1 ■ The Dirac delta Function 


The Dirac delta function S (x ) is a generalized function with the following defining 
property:/or every continuous function f(x). 


8(x)f(x)dx = f( 0). 


(3.47) 


This looks innocent enough, but in fact 8 ( x) is an extremely singular function. To 
satisfy the definition, 8(x) must be 0 for all values of x not equal to 0. (Proof: 
Consider the class of functions g(x) such that g(x) > 0 for all x, and g(0) = 0. 
Then f 8(x)g(x)dx = 0. The only way this can be true for every g(x) in the class 
is if 8(x) = 0 for all x f 0.) However, the integral of 8{x) is 1, 


<5(.r) dx = 1. 


(3.48) 


(Proof: Let f(x) be the constant function 1 in (3.47).) Thus the graph of 8{x) is 
an infinitely sharp spike at x = 0. 
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FIGURE 3.9 The Dirac delta function is the limit of a sequence of sharply peaked 
functions. In the limit, the width —> 0 and the integral —> 1. 


To justify rigorously the existence of such a mathematical object requires a 
careful analysis in advanced mathematics. In fact, mathematicians call &{x) a 
“generalized function” or “Schwartz distribution,” to distinguish it from ordinary 
functions. But we will adopt a more intuitive attitude toward the delta function. 
It is perhaps most easily understood as the limit of a sequence of more and more 
sharply peaked functions, as illustrated in Fig. 3.9. The delta function was in¬ 
vented by a physicist (Paul Dirac) and is widely used in many branches of math¬ 
ematical physics. 

The 3-dimensional delta function <5 3 (x) is defined analogously: For any con¬ 
tinuous function /(x), 


j S\x)f(x)d 3 x = /(0). (3.49) 

Since d 3 x is just dx dy dz, the 3D delta function is the product of ID delta func¬ 
tions for each Cartesian coordinate 

s 3 (x) = au)a(y)au). (3.50) 

Thus <5 3 (x) is 0 for x ^ 0; its integral is 1, /6 3 (x)d 3 .r = 1. If x is a spatial 
coordinate, then dx has units of length, and <5 (x) has units of (length) -1 because 
f S(x)dx = 1. The units of <5 3 (x) are (length) -3 . 

The delta function usually appears in an integral. We evaluate integrals involv¬ 
ing the delta function by changing the variable of integration (if necessary) to 
convert the integral to the form (3.47), or in three dimensions (3.49). For exam¬ 
ple, consider the integral of 8(x — y)f(y) with respect to y; changing the variable 
of integration from y to £ = x — y we find 

/ OO /*oo 

Hx-y)f(y)dy= S^)f(x-^ = f(x) (3.51) 
-OO J —oo 

where the second equality follows from (3.47). Similarly, for three dimensions, 
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I S 3 (x-y)f(y)d 3 y = /(x), (3.52) 

which is an identity that we will use later. The meaning of (3.52) should be clear: 
<5 3 (x) is a sharp spike at x = 0, so <5 3 (x — y) is a spike at y = x; integrating over 
y picks out the value of /(y) at y = x. 


The Green's Function of—V 2 

Green’s function techniques, which have wide usage in physics, provide a way to 
write the solution of a partial differential equation in closed form, as an integral 
over distributed sources. In electrostatics the partial differential equation is Pois¬ 
son’s equation (3.33), and the source is the charge density. The physical reasoning 
underlying the mathematical technique is that any distributed source can be con¬ 
sidered as a sum or integral over elemental sources. Therefore, one first finds the 
potential function of a point source of unit strength located at an arbitrary point 
x'; that potential is the Green’s function. Then the potential function for the whole 
charge distribution is the integral of the Green’s function over the source density. 

The Green’s function of —V 2 is the function G(x — x') defined by the equation 

—V 2 G(x - x') = <$ 3 (x - x'), (3.53) 


along with the boundary condition that G approaches 0 at infinity. 10 We shall 
prove the following theorem: 


G(x-x') 


1 

47t|x - x'| ’ 


(3.54) 


Proof: First, it is sufficient to let x! be at the origin, because all the relevant func¬ 
tions depend only on x — x', the relative position of x and x'. So, what we need to 
prove is that 


-V 2 ^- =4tt5 3 (x). 


(3.55) 


For all x with |x| f 0 the left-hand side of (3.55) is 0: Using spherical coordinates, 

*G)-£GW=!(;)-" 


for r ^ 0. (This may also be proven in Cartesian coordinates.) For the point 
|x| = 0, V 2 (l/r) is not well-defined in the usual sense, because 1/r is singular at 
r = 0. To discover what happens at the origin, consider the integral of V 2 (l/r) 
in a spherical volume V of any radius around the origin; using Gauss’s theorem, 



(3.57) 


*°We shall see in Sec. 3.6 that G(x — x')/ f 0 is the electrostatic potential of a unit charge at x', i.e„ 
charge density S 2 (x — x'). 
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where S is the spherical boundary of V. But r- V(l/r) is — 1 /r 2 , and dA = r 2 d$l\ 
therefore the right-hand side of (3.57) is — f s dQ = —4tr. So we have shown that 
V 2 (l/r) is 0 for r / 0, but its integral over the volume of any sphere is —4ir. 
These are precisely the characteristics of the delta function, summarized in (3.49); 
hence equation (3.55) is proven. 

An alternative proof of (3.55), somewhat more comfortable for people who 
avoid oo in calculations, can be provided. This analysis does not use the singular 
function 1/r directly, but regards 1/r as the limit ofl/Vr 2 + e 2 ase -* 0. 
Denote by g(r, e) the latter function, which is nowhere infinite as long as c ^ 0. 
Now, it is straightforward to calculate — V 2 g, 

2 3e 2 

V (r 2 + 62)5/2- 

This function becomes more and more sharply peaked at r = 0 as e approaches 0. 
Note that — V 2 g tends to 0 if r ^ 0, and to oo if r = 0, as e —*■ 0. The total 
volume integral of — V 2 g is 4 n, independent of e; the integral can be evaluated 
either using Gauss’s theorem, or more directly by integrating over spherical polar 
coordinates. Thus — V 2 g approaches 4 tt<5 3 (x) as c —* 0, proving (3.55) since 
g(r, 0) = 1/r. 

The reader may feel uneasy about this very singular mathematics. But Green’s 
function methods are common in mathematical physics, and the delta function is 
a modem tool of theoretical physics. In Sec. 3.6 we will use the Green’s function 
(3.54)—as the potential of a unit point source—to solve Poisson’s equation. 


3.5.2 ■ Another Proof of Gauss's Law 


After these mathematical developments, we are prepared to calculate the diver¬ 
gence of E directly from (3.30). Knowing the Dirac delta function, the calculation 
is extremely simple: really just a formality. From (3.30) and (3.54), 


V E = 



—V 2 G(x — x')j p(x)d 3 x'; 


(3.58) 


but the quantity in square brackets is <5 3 (x — x'), so by the identity (3.52), V • E = 
p(x)/e o, which is the divergence equation (3.25), Gauss’s Law of electricity. 


3.6 ■ THE ELECTRIC POTENTIAL 
3.6.1 ■ Definition and Construction 

A general problem of electrostatics, which we are now prepared to consider, is to 
find E(x) everywhere for an arbitrary charge distribution p(x'), if p(x') is known 
a priori. This is an important milepost on our path to learning how to solve the 
most general problem in electrostatics, which includes conductors and dielectrics 
in the system, discussed in Chapters 4 through 6. 
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For all of these cases the solution for E(x) should be unique. This raises a 
mathematical question: What charges, potentials, and boundary conditions must 
be specified in order that the function E(x) that solves the field equations is a 
unique solution? The answer must be consistent with the physical condition of 
uniqueness: Once the charges, batteries, conductors, and dielectrics are in place, 
then nature determines a unique field E, which could be measured with suitable 
apparatus. 

For electrostatics the electric field E(x) is irrotationai, that is, V x E = 0. It 
follows that we may construct the electric potential V (x), a scalar function such 
that 


E(x) = —VV(x). 


(3.59) 


We proved in Chapter 2 that the curl of the gradient of any scalar function is 0, 
so (3.59) guarantees that E is irrotationai. The electric potential was introduced 
into the theory of electrostatics by Green. 11 It is an important mathematical tech¬ 
nique in field theory. The SI unit of electric potential is the volt (V) defined by 
1 V=1 J/C; that is, voltage has the units of work per unit charge. 

To justify (3.59) we must address two issues—the existence and uniqueness 
of such a function V (x). As to existence, we have already seen that E(x) can be 
written as a gradient in (3.30), and indeed from that equation we have written a 
formula for V (x) of a static charge density, 


v (x) = r~ [ 
47T60 J 


p(\')d 3 x' 
Ix-x'l • 


(3.60) 


This proves the existence of V(x), by construction. For example, for a point 
charge q at the origin the electrostatic potential is 

V(x) = -2—. (3.61) 

4^-eo r 

One can easily verify (3.59) for this basic example. 

But it is instructive to construct V(x) in another way. Let — V (x) be the line 
integral of E along a curve T from a reference point xq to x 


V (x) = - J E ■ di. (3.62) 

This equation says that V (x) is equal to the work that must be done by an ex¬ 
ternal force, acting against the electric force, to move unit charge from xo to x 
along the path F, because —E is the force per unit charge that the external agent 

” George Green (1793-1841) owned a grain mill in Nottingham, England, and was a self-taught 
amateur mathematician. In 1828 he published a paper on the mathematical theory of electricity and 
magnetism, introducing the concept of the potential, which was largely ignored. But the paper was 
rediscovered by Lord Kelvin in 1845, and became influential in the development of the theory. The 
Green’s function is named after George Green. 
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must exert. 12 We will prove below that V(x) does not depend on the particular 
choice of the path T from xo to x, because E is irrotational. The function V (x) in 
(3.62) satisfies (3.59). Consider the change of V(x) for an arbitrary infinitesimal 
displacement <5x, 

V(x + <5x) — V(x) = W ■ 8x = — J E • dx + f E • dx = —E(x) • <5x. 

Jr+8x Jr 

The first equality is the definition of W; the second equality follows from (3.62), 
because the path of integration for V (x + <5x) is T from xo to x followed by the 
extra displacement <5x from x to x + <5x. Thus E(x) = — W as required. 

Now, what about uniqueness? The line integral in (3.62) is set up with a refer¬ 
ence point xo and path T. These are not unique, so the question is whether V (x) 
depends on xo or T. In fact, V (x) does not depend on T, but in general it does 
depend on xo- Let Ti and T 2 be two different paths both from xo to x, and Vi and 
V 2 the corresponding functions defined by (3.62). Then 

Vi(x) - V 2 (x) = - [ E ■ d£ + f E-d£ = -(hE-d£, (3.63) 

J i"! Jr 2 Jc 

where C is the closed curve consisting of T i from xo to x, and the reverse of T 2 
from x back to xo- But § E • d£ = 0 by Stokes’s theorem, because V x E = 0. 
Thus V\ = V 2 , proving that V (x) does not depend on the path. 

The path independence leads to an interpretation of the potential difference be¬ 
tween two points. The difference V(X 2 ) — V (xi) is equal to the work that must be 
done by an external agent to move unit charge from xi to X 2 against the electric 
force, along any path. The work is independent of the path because the electro¬ 
static force is conservative. 

In general V (x) does depend on the reference point x<>. If we choose another 
reference point, say xi, then the new potential V\ (x) may be written as the line 
integral from xi to xo, and then from xo to x; the first part is — V(xi), and the 
second part is V (x), so 


V](x) = V(x) - V( Xl ). (3.64) 

Changing the reference point is equivalent to adding the constant C = — V (xi) to 
the original potential. Thus the potential V (x), as a function of x, is determined 
only up to an additive constant. It is obvious that V (x) + C satisfies (3.59) for any 
constant C. The nontrivial point is that there is no other nonuniqueness of V (x) 
besides an additive constant. 

When solving problems, it often helps to make a judicious choice of the ref¬ 
erence point. If the total charge in the system is finite, then the reference point is 
usully chosen to be at infinity. In that case V (x) is the work required to bring unit 
charge from oo to x, and V (x) approaches 0 as x goes to oo. If, however, the sys- 


12 Or, equivalently, V (x) is the work per unit charge done by the electric force acting on a test charge 
that moves from x to xq. 
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tem has infinite charge, as in a charged line or plane, then choosing the reference 
point at oo is not valid; the potential diverges at oo. In such cases, as we’ll see 
in Example 8 following, one may choose any other convenient point, or leave the 
reference point unspecified. 

We showed in Sec. 3.3 that the normal component of the electric field, at a 
surface, is discontinuous if there is a surface charge. The potential, however, is 
continuous, because it is equal to the line integral of E along a path. The variation 
of the integral, as the path crosses the surface, cannot be discontinuous unless 
the charge density is infinite, which would be unphysical. A discontinuity of E 
corresponds to a change in the gradient of V, rather than a discontinuity of V. 

We recall from Chapter 2 that the gradient of a scalar function is everywhere 
perpendicular to any surface on which the function is constant. Applying this 
geometric relationship to V (x) and E(x), the electric field vectors are everywhere 
normal to equipotential surfaces, pointing toward smaller V. We may visualize 
E(x) by drawing the electric field lines, which are the tangent curves of the vector 
field, i.e., the set of curves everywhere tangent to the field vectors. An electric 
field line crossing any equipotential surface is perpendicular to the surface. 

3.6.2 ■ Poisson's Equation 

The electrostatic field is determined throughout space by local relations, V x E = 
0 and V • E = p/eo- Introducing the potential, i.e., writing E = —VV\ guarantees 
VxE = 0, but that equation gives no information on V (x). Gauss’s Law becomes 

-V 2 V = p/e 0 . (3.65) 

This equation, called Poisson’s equation, together with the boundary conditions, 
must determine the potential function. 

Green's Solution 

The Green’s function G(x — x') is defined by (3.53). Comparing the definition 
with Poisson’s equation shows that G (x — x')/eo is the solution of Poisson’s equa¬ 
tion for a point source of strength 1 at x', with the boundary condition G —> 0 
at infinity. (The density function of a point source is proportional to the delta 
function—an infinitely sharp spike at the source point.) In other words, G/eo is 
the potential of a unit point source. 

We may write the general solution to Poisson’s equation in terms of the Green’s 
function, as 


V(x) = — f G(x — x / )p(x , )^I 3 .r , , (3.66) 

eo J 

which is known as Green’s solution to Poisson’s equation for an unbounded space. 
(This equation is just the same as (3.60). For practice, the student should verify 
again, by the methods of Sec. 3.5, that this V (x) satisfies (3.65).) Green’s solution 
is an expression of the superposition principle: G(x —x') dq /eo is the potential for 
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the field created by the point-like charge dq = p(x')d 3 x', and the superposition 
of all elemental contributions is the total potential. 

The Helmholtz Theorem 

By writing E = —W, and using Green’s solution to Poisson’s equation, we have 
solved the field equations V x E = 0 and V E = p/e o. This is a special case 
of a more general mathematical problem—to find a function F(x) with specified 
curl and divergence. (For static E the curl is always 0.) The Helmholtz theorem, 
which is discussed in Appendix B, supplies the solution to this general problem. 

3.6.3 ■ Example Calculations of V(x) 

As we have seen, if an eleclrostatic system is highly symmetric then E may be 
calculated from the flux through an appropriate Gaussian surface. When there is 
not enough symmetry to apply this method, the simplest approach, if the charge 
density is known, is usually to calculate V (x) from Green’s solution, and then E 
is — W. 


EXAMPLE 8 What is the electrostatic potential, and electric field, of a uniformly 
charged straight wire of length 21, for points on the midplane of the wire? 


z 



FIGURE 3.10 Examples 8 and 10: A uniformly charged line segment, with charge per 
unit length A, extends from z = — £ to z = +t on the z axis. In Example 8, V and E are 
found on the midplane (z = 0). In Example 10, V (x) is found for all x The equipotentials 
in the xz plane are ellipses with foci at the ends of the line segment, shown as dashed 
curves. The equipotentials in three dimensions are ellipsoids of revolution. 
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The wire is shown in Fig. 3.10. It lies on the z axis from z = — l to z = +£, 
the midplane is the xy plane, and A denotes the charge per unit length. By (3.60) 


V(r) 


= -L [ 

47re 0 J _ 


f* - X ml 

l + Vr 2 + t 2 

/-< Vr 2 + z /2 47reo 

-t + s/r 1 + l 1 


(3.67) 


where r is the perpendicular distance to the wire, i.e., the radial coordinate of 
cylindrical coordinates. On the midplane the electric field is radial, by symmetry, 
so 


„ „9V Mr 

E(r) = —r— = —- 

2n€orVr 2 + i 2 


(3.68) 


The same result was obtained earlier in Example 2. 

In the limit £ —► oo, the field approaches the result (3.44) of Example 6. In 
this limit the total charge is infinite. The midplane field is well-defined in the 
limit, but the potential (3.67) diverges as l —► oo. For (3.67) the reference point 
(at which V = 0) is at r = oo, which is fine for finite i, but not valid for an 
infinite charged line. For an infinite wire the potential can be written as Vii ne (r) = 
— (A/27reo) ln(r/ro), where the reference point (where Vij ne = 0) is r = r o, an 
arbitrary constant. This function Vii ne (r) diverges as r —► oo. 


EXAMPLE 9 What is the electrostatic potential, and electric field, above a uni¬ 
formly charged circular plate of radius a, on the axis of symmetry? 

Let a be the charge per unit area, and let the z axis be the axis of symmetry, as 
shown in Fig. 3.11. Then by (3.60), using cylindrical coordinates, 


V(0,0, z) 


1 /’ a alnr'dr' 

47reo Jo Jr' 2 + z 1 




(3.69) 


z 



FIGURE 3.11 Example 9: A uniformly charged circular plate. dE is the field of the 
elemental ring. 
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On the positive z axis the electric field is in the direction of k, by symmetry; then 

z 


r dV a k 
E(0, 0, z) = -k— = — 
3 z 2eo 


1 


Va 2 + z 2 J 

In the limit a —» oo this field agrees with the result (3.46) of Example 7. 


(3.70) 


EXAMPLE 10 In the previous two examples we considered only points of sym¬ 
metry of the system. In the current example we consider the full space. The cal¬ 
culation is more difficult but worth the effort. The system is a uniformly charged 
line segment of length 21 with charge per unit length A. 

The potential function. Set u p a coordinate system with the line segment o n the z 
axis from z = — l to z = l ; this is the same configuration as in Examples 2 and 8. 
Figure 3.10 shows the xz plane (y =0). We shall first calculate the potential V in 
this plane, and then generalize the result to three dimensions by axial symmetry. 
At the point ( x , 0, z) the potential may be written 


V(x, 0, z) 


A j‘ + * dz! 

4ttco J-e y/ x 2 + (z — z') 2 


(3.71) 


where A dz' is the element of charge on the line segment at (0, 0, z'). After chang¬ 
ing the variable of integration to f = z. — z! the integral is a standard form that 
can be found in integral tables, and the result is 


V(x, 0, z) 


A | z + l + %/* 2 + (z + tf 2 

4jre 0 \ Z -e + yjX 2 + (Z — l ) 2 


(3.72) 


To extend the result to three dimensions, simply replace x 2 by x 2 + y 2 \ because 
by rotational invariance about the z axis, V at a given z depends only on the 
perpendicular distance from the field point to the z axis. So, for all points 


V(x, y, z) 


A z + i + y/x 2 + y 2 + (z + 1) 2 

4tt€0 z - 1 + yfx 2 y 2 + (z - f) 2 


(3.73) 


The equipotential surfaces. An equipotential surface, defined by the equation 
V(x,y, z) = Vo where Vo is a constant, is the locus of points where the argument 
of the logarithm in (3.73) is constant. Deducing the shape of the surface from this 
equation is not very easy, but we can guess the answer and then verify that our 
guess is correct. If the charge were restricted to a point, then the equipotentials 
would be spheres; such a symmetric system would necessarily have very symmet¬ 
ric equipotentials. Here the charge is spread out uniformly along a line segment, 
but that is still quite a symmetric system, so the equipotentials should again be 
quite symmetric, but now having two special points (the endpoints of the line). A 
natural guess is that the equipotentials are ellipsoids of revolution. An ellipsoid is 
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the surface swept out when an ellipse in the xz plane is rotated about the z axis. 
We shall verify that an equipotential curve in the xz plane is an ellipse whose foci 
are at (0, 0, £) and (0, 0, — t). 

The equation for an ellipse in the xz plane is 



(3.74) 


where c is called the semimajor axis and a the semiminor axis, assuming c > a. 
The ellipse intersects the z axis at z = ±c, and the x axis at x = ±a. Be¬ 
cause the focal points are at ±£ on the z axis, the minor axis may be written as 
a = s/c 2 — £ 2 . [Note that the sum of the distanc es from (x, z) to the two foci is 
constant along the ellipse; on the x axis it is 2Va 2 + £ 2 , and on the z axis 2c.] 
Now we replace x 2 in the argument of the logarithm of (3.72) by a 2 (l — z 2 /c 2 ), 
and simplify the result i n terms of c and £. A little algebra shows that the numer¬ 
ator is (c + £)(l + z/c) and the denominator (c — £)(l + z/c), so for points on 
the ellipse the potential is a constant Vo given by 


V 0 = 



4rr €q 


c + £ 
c-£ 


(3.75) 


Thus the equipotential curves in the xz plane are confocal ellipses, four of which 
are shown in Fig. 3.10. The potential Vo depends on the semimajor axis c, and 
decreases as c increases. 

By rotational invariance about the z axis the equipotential surfaces in three 
dimensions are confocal ellipsoids of revolution. These surfaces are prolate 
spheroids, characterized by the parameter e = £/c, which is the eccentricity. If 
e approaches 0, i.e., far from the charged line segment, the surface becomes a 
sphere. If e approaches 1, i.e., near the charged segment, the surface becomes 
needle-shaped with large curvature at the ends at z = ±c. We’ll make use of this 
example again in Chapter 4, to determine the field of a charged ellipsoid. 


3.7 ■ ENERGY OF THE ELECTRIC FIELD 

The electrostatic potential is closely related to potential energy. What is the poten¬ 
tial energy associated with the electrostatic force on a charge ql The work done 
by the electric field, as the charge moves along a curve T from the reference point 
xo to x, is 

W = J qE d£=-qV(x). (3.76) 

The potential energy U q (x) with respect to the reference point is by definition 
— W; so 


U q (x) = qV (x). 


(3.77) 
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Thus another way to define the electrostatic potential is that V(x) is the potential 
energy per unit charge of the electric force on a test charge. [The potential energy 
of a force F is well-defined provided V x F = 0, which is true for the electrostatic 
force because E(x) is irrotational.] 

We may use this result to determine the total electrostatic energy of a charge 
distribution p(x). First consider just two point charges q\ and <y 2 - The work done 
by an external agent to bring q 2 from infinity to its position X 2 , against the field 
of q\ , is the potential energy of interaction between the charges 


U 12 = #2Vj = 


q\qi 


4;reo|xj -x 2 | 


For a set of charges { q \, q 2 ,... ■ ,qu} at positions [xi, x 2 ,... 
has potential energy U,j, so the total energy is 


(3.78) 


, xjv} each pair (i j ) 



Mi 

4rre 0 |x,- -xj |’ 


(3.79) 


the double sum over i and j excludes terms with i = j, and the factor 1/2 
accounts for the appearance of each pair twice in the double sum. U is the energy 
required to assemble the given charge distribution. For a continuum with charge 
density p(x), all the charge in an infinitesimal volume dr’x is combined, and the 
sum over discrete particles becomes an integral over volume; then the energy 
is 


1 [ P(X)P(X ° dhd*x' 

2 J 4;reo|x - x' 


Equivalently, we may write 


(3.80) 


U = \ J p(x)V(x)d 3 x. (3.81) 

We regard the electric field as a real physical entity, possessing energy. This 
viewpoint becomes more important later, when we study light as an electromag¬ 
netic wave: The energy transported through vacuum by light must obviously be¬ 
long to the fields. But even in electrostatics we adopt the view that the energy U 
resides in the field. Therefore, we will now rewrite the equation for U entirely 
in terms of the electric field E(x) produced by the charge density. The function 
p(x) V(x) is, by Poisson’s equation, 

pV = -eo(V 2 V) V = -e 0 V • (VVV) + e 0 (W) 2 , (3.82) 

the second equality being an identity of vector calculus. The total volume integral 
of V • (VW) is 0, because by Gauss’s theorem it is equal to the flux of VW 
through the surface at infinity, where V = 0. The final term in (3.82) is eoE 2 . 
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Thus the total energy is 


U -jJ^ 


(3.83) 


The energy density (energy per unit volume) of the electric field is evidently 


ueOO = y£ 2 (x). 


(3.84) 


The idea that energy belongs to the field is intriguing. For example, electro¬ 
static energy stored in fields in the atmosphere, is, on a large scale, the origin of 
lightning; or, on a small scale, the origin of the spark between your hand and a 
metal doorknob. 


EXAMPLE 11 The potential energy of two point charges q\ and qj separated by 
d is q\q2/(4ne()d). How is this potential energy related to the total field energy 
(3.83)? 

Instead of point charges, consider two charged spheres whose centers are sep¬ 
arated by distance d. By the superposition principle, the electric field is Ei + E 2 , 
so the field energy is 

U = y J (e] + E\ + 2Ei • E 2 ) d 3 x = Ui + U 2 + Un • (3.85) 

U 1 and U 2 are self-energies. U\ is the energy of q\ alone, and t/ 2 that of t/ 2 . These 
are independent of the separation d. U \2 is the interaction energy. 

The self-energy U \ is Cq 3 /(4 tt€()R\), where R\ is the radius of the first sphere, 
and C is a dimensionless constant that depends on how the charge is distributed in 
the sphere. 13 f/ 2 is analogous. Note that U\ —>■ 00 as R 1 —> 0; the self-energy of a 
point charge is infinite. This infinity is not relevant to the classical theory, which 
describes macroscopic systems, but it is an issue in quantum electrodynamics 
because the electron is a point particle. 14 

The interaction energy U \2 remains finite in the limit of point charges. It can 
be shown 15 that for point charges, (3.85) gives U i 2 = q\q2/(4neod). Of course 
we obtained the same result earlier (3.78), but here it is derived from the field 
energy. The total field energy U (including self-energies) must be positive, be¬ 
cause M£(x) > 0. But the interaction energy may be negative, implying attraction 
between the charges, if they have opposite signs. 


13 See Exercises 26 and 28. 

14 The procedures of renormalization, developed by Feynman and Schwinger, render the predictions 
of QED finite. 

15 See Exercise 27. 
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3.8 ■ THE MULTIPOLE EXPANSION 

The potential and field due to a point charge at the origin are 

V(x) = and E(x) = (3.86) 

4^eo r 4^eo 

Here r = |x| and ? = x/r. Now consider a collection of charges in a region near 
the origin. The problem we shall solve in this section is to determine the asymp¬ 
totic form of V (x), i.e., far from the charge distribution. The complete treatment 
of this problem is to express V (x) as an expansion in powers of 1/r, called the 
multipole expansion. We won’t derive the full multipole expansion, but rather 
consider the first few terms, which are the most important. 

3.8.1 ■ Two Charges 

As a first step, consider just two charges, q\ at X] and q 2 at X 2 -The exact potential 
function is the superposition 


V(x) = 


q\ 

47re 0 |x — X] | 


+ 


g2 

4jreo|x - X 2 I ’ 


(3.87) 


Asymptotically, i.e., for r r\ and r J>> r 2 , we may make the expansion (for 
k = 1 or 2) 


1 

|x-x*| 


1 f-x fc 3(f -\k) 2 -rl 

~ r r 2 2r 3 



(3.88) 


an expansion in powers of 1/ r. As r —> 00 , each term is smaller than the one be¬ 
fore. The notation 0(l/r 4 ) stands for terms proportional to powers of 1 /r quartic 
or higher. To derive the expansion (3.88) write 


■x*l 


^r 2 -2rr k co&e + rl 


(3.89) 


where e = (—2r/>cos0 + rjt) /r 2 and 6 is the angle between x and x^. Expand 
the result in e using the Taylor series 




(3.90) 


and reexpress the result in powers of 1/r, dropping terms of order r^/r 4 . 

We obtain the multipole expansion for V(x) by substituting (3.88) into (3.87). 
Neglecting terms higher than the quadrupole, the result is 

1 \Q ?_p r • Q 2 • r ) 

4^eo i r r 2 r 3 J 


V(x) = 


(3.91) 
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The three terms in (3.91) are called the monopole ( Q ), dipole (p), and quadrupole 
(Q 2 ) terms. Q is a scalar, equal to q\ + qz, the total charge. The monopole term 
is dominant as r —> 00 unless 2 = 0. The parameter p is a vector, given by 

p = 4 - 1 X 1 + < 72 x 2 , (3.92) 

called the electric dipole moment of the system. The dipole term is dominant at 
large r if Q = 0. The parameter Q 2 is a tensor (or dyadic) 16 , given by 

Q 2 = y [ 3 x 1 x 1 - rfl] + | [ 3 x 2 X 2 - r\ i] , (3.93) 

called the quadrupole moment. Here I denotes the unit tensor. As a vector has one 
index, a tensor has two indices; for example, (X|Xi),y = xux\j and (I),y = <5,y. 
The reader should not feel intimidated by the quadrupole. We shall often need the 
dipole potential—the second term in (3.91)—but rarely the higher multipoles. 

Notice the dependence on r for the various multipoles: For a charge (monopole) 
the potential decreases as 1/r and the field as \/r 2 \ for a dipole the potential de¬ 
creases as 1/r 2 and the field as 1/r 3 ; for a quadrupole the potential decreases as 
1/r 3 and the field as 1/r 4 . This pattern generalizes to higher multipoles. 


EXAMPLE 12 To understand (3.91) it helps to work out a prototype case, with q\ 
and qi on the z axis, at z = d\ and z = dz respectively, illustrated in Fig. 3.12. 
By symmetry, V(x) is V (r, 0) (independent of cp). The exact potential is 


V(r, 6) = 


4jreo 


q\ 


+ 


42 


yJr 2 — 2rd\ cos 9 + d 2 yjr 2 — 2rd2 cos 0 + d- 


(3.94) 


z 



FIGURE 3.12 A system of 2 charges, q\ and 52 - The potential at P is Eq. (3.94). 

16 The student may have encountered in mechanics, the inertia tensor of a mass distribution; the 
quadrupole tensor Q 2 is analogous for a charge distribution. 
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V (r, 6>) = 

1 <71+ <72 , (<?l^l + qidi) cos 0 (qid}+q 2 d%) / 2 \ 

4jt 6 0 r r 2 2r 3 V / 

(3-95) 

This result agrees with the general equation (3.91). The total charge is Q = <71 + 
< 72 * and the dipole moment is p = (q\d\ + < 72 <^ 2 )k. The quadrupole moment is 

Q 2 = \{q x d\ + ^ 2 ^ 2 )( 3 kk - I). (3.96) 

In general the dipole and quadrupole moments depend on the choice of ori¬ 
gin. For example, (3.95) shows that the multipole expansion for a single charge 
displaced from the origin, e.g., <71 at (0, 0, <7i) with q 2 = 0, has nonzero dipole 
and quadrupole moments, p = <?i<7ik and Q 2 = < 7 i< 7 2 ( 3 kk — I)/2; whereas if the 
origin were chosen to be the position of q\ there would be only a monopole term. 
An interesting special case occurs if the total charge in the distribution is 0; then 
p does not depend on the choice of origin, as we shall prove generally later. 


3.8.2 ■ The Electric Dipole 

For two equal but opposite charges, i.e., q\ = q and q 2 = —< 7 , the total charge is 
Q = 0, and the dipole moment is 


p = <?d 


(3.97) 


where d = xj — x 2 is the vector from the negative charge (q 2 — —q) to the 
positive charge (q\ = q). This system is called an electric dipole. An important 
limiting case is the limit d —► 0 with p fixed, called a point-like electric dipole. In 
this limit, and taking the origin to be the position of the dipole, all other multipole 
moments (quadrupole and higher) are 0. The potential of a point-like dipole is 


V(x) 

for all x. If the dipole points in the z 
coordinates the dipole potential is 


p r 


(3.98) 


4 jt €or 2 

direction, i.e., p = pok, then in spherical 


V (r, 0) 


po cos 0 
Ait cor 2 


(3.99) 


The point-dipole potential is a good approximation for a neutral charge distri¬ 
bution that is small compared to all length scales in the problem. For example, a 
molecule, having a size of order 10 -10 m, acts as a point-like dipole when placed 
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3.8.3 




(a) (b) 

FIGURE 3.13 Field lines of (a) a finite dipole, and (b) a point dipole. For both cases the 
dipole moment points in the z direction. 


in a laboratory-scale field. The permanent dipole moments of small molecules 
such as H 2 O, NH 3 , HC1, or CO, are of the order of 1 debye (D), where 1 D = 
3.33 x 10 -30 C m. One debye is approximately the dipole moment of two charges 
±0.2e separated by 1 angstrom. 

The electric field produced by a point-like electric dipole located at the origin 
is 


3? (p • r) - p 

E(x) = -VV= V % . (3.100) 

Equation (3.100) is an important and interesting result, the derivation of which is 
left as an exercise in vector calculus . 17 The field lines for finite and point dipoles 
are plotted in Fig. 3.13. The reader should verify that (3.100) agrees with the 
figure. In particular, verify that E is antiparallel to p on the plane perpendicular to 
p, and parallel to p on the axis of p. 

For the special case of a dipole p = pok located at the origin and pointing in 
the z direction, the field in spherical polar coordinates is 

E(x) = —VV = ^ 3 ^2cos0 r + sin0 6^ , (3.101) 

an expression that will prove useful for analyzing the field directions. 

We will see in Chapter 6 that the atoms and molecules of a dielectric mate¬ 
rial behave as electric dipoles. Because a dipole has an associated electric field 
(3.100), the presence of a dielectric affects the total electric field in its neighbor¬ 
hood. 

Moments of a General Charge Distribution 

For N point charges q\ , < 72 , • • •, qN at positions xt, X 2 ,..., x,v the potential V (x) 
can again be expanded as in (3.91). Generalizing from (3.92) and (3.93) the mo¬ 
ments are 

17 


See Exercise 37. 
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N 

Q = 

k =1 
N 

P=J2^ kXk (3.102) 

k =i 

Qi = X! ? ( 3x * x * _ r * 21 ) • 

*=i z 

For Af > 2 it is possible to have both Q = 0 and p = 0, in which case the 
asymptotic potential is the quadrupole potential, unless the quadrupole moment 
tensor Qi happens to be 0. 

If the charge distribution is a continuum with charge density p(x'), which is 
assumed to have finite spatial extent, then for |x| the size of the region of 
charge we may again use (3.88) for l/|x — x'| in (3.60). In this case the multipole 
moments are 


Q 

P 

Q 2 


J p(x')d 3 x' 

/ x><x ' ) ‘ ,v 

/ \ ( 3x ' x ' -r a \}p(y!)d 3 x'. 


(3.103a) 

(3.103b) 

(3.103c) 


Q is the total charge, p the electric dipole moment, and Q 2 the quadrupole tensor, 
of the charge distribution. For example, a molecule is not simply a pair of opposite 
charges, but rather a collection of nuclei surrounded by a cloud of electrons. The 
dipole moment of a molecule is determined by (3.103b) for the charge density of 
the nuclei and electrons. 

Using (3.103b) we may prove the theorem that the dipole moment is indepen¬ 
dent of the choice of origin if the total charge is 0. If the origin is moved to a point 
xo, then the dipole moment with respect to the new origin is 


Po 


J (x' - XO )p(x')d 3 x' = p - 0 X 0 , 


which is the same as p if Q = 0. 


3.8.4 ■ Equipotentials and Field Lines 

The potential function of a point dipole is (3.98); or in polar coordinates, letting 
k be the direction of p, (3.99). Using the latter form, the equipotential surfaces 
are given by the equation r 2 = C cos 8 where C = po/(47reoV). Figure 3.14 
shows the equipotentials (dashed curves) and electric field lines (solid curves) of 
the point dipole. 



80 


Chapter 3 Basic Principles of Electrostatics 



FIGURE 3.14 Equipotentials (dashed curves) and field lines (solid curves) of a point 
dipole. The dipole points in the z direction. 


The potential function of a point-like linear quadrupole, lying along the z axis, 
is 


V(r,9) = 


Q 2 (3 cos 2 6 — l) 
4;reo 2r 3 


(3.104) 


where Q 2 is the (scalar) magnitude of the quadrupole moment tensor. For exam¬ 
ple, if there are charges +q at z = id and —2 q at z = 0, then Q 2 = 2 qd 2 . The 
equipotential surfaces are given by the equation 

r 3 = C(3cos 2 0 — 1), (3.105) 


where C is a constant depending on V. Figure 3.15 shows the equipotentials 
(dashed curves) and electric field lines (solid curves) of the point quadrupole. 


3.8.5 ■ Torque and Potential Energy for a Dipole in an Electric Field 

An electric dipole p in a field E experiences a torque that twists the dipole toward 
alignment with E. An elementary way to determine the torque is to treat the dipole 
as a pair of opposite charges pulled in opposite directions along E. But we will 
derive the result in a more general way. For a distribution with charge density 
p(x), the torque (about the origin) is 

N = J \xdF = J xx E (x)p(x)d 3 x, (3.106) 

because the force on the volume element d 3 x is pd 3 x E. Assuming that E is con¬ 
stant on the length scale of p(x), i.e., that we are dealing with a point-like dipole, 
xE can be pulled out of the integral, and what remains is the dipole moment 
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FIGURE 3.15 Equipotentials (dashed curves) and field lines (solid curves) of a linear 
quadrupole. 


(3.103b); that is. 


N — p x E. 


(3.107) 


The torque is perpendicular to E, so it twists p toward alignment with E. 

We may define a potential energy U for the dipole p in an external electric 
field. The interaction energy f dq V is 

U = J p(x')V(x')d 3 x', (3.108) 

where p(x') is the charge density of the dipole. Assuming the dipole is pointlike, 
i.e., much smaller than the length scale for variation of E, we may approximate 
V (x') using Taylor’s theorem, 

V(x') % y(x) + (x' -x) • VV(x), (3.109) 

where x is at the center of the dipole. When this is inserted in (3.108) there are 
three terms. Two are 0 because f p(x')d^x' — 0 for a neutral dipole. The remain¬ 
ing term is 


U = J p(x)x'd 3 x' ■ VV(x) 
= -P • E(x). 


(3.110) 


Considering all directions of p, U is minimum when p is parallel to E. Therefore 
p || E is the state of stable equilibrium for the dipole orientation. Note that the 
torque is 0 for this orientation. 
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The function E/(x) in (3.110) is more than the orientational potential energy; 
it is the full potential energy of the point-like dipole in the field E(x), taking the 
potential energy to be 0 at infinity. In other words, U (x) is equal to the work 
required to bring the dipole to x from infinity. One way to prove this claim is to 
imagine that the dipole at infinity is compressed to an uncharged point particle; 
let — wo be the work supplied in this first step. No work is required to move the 
point particle through the electric field, to x, because it is uncharged. When the 
particle is at x, the charges ±q must be separated by dx, say by moving +q by 
dx but holding —q fixed; the work done is +wo against the interaction between 
the charges, plus — (gE) • dx = —p • E(x) against the electric field. Thus the total 
work is (3.110). 

The net electric force on the dipole is, by (3.110), 

F(x) = -Vt/ = V (p • E(x)). (3.111) 

In a uniform field there is a torque on the dipole, but no net electric force. If the 
field varies with position, there is a net electric force as well as a torque. The 
force equation can be written another way. Substitute E = —VV into (3.111); 
and, using the fact that p is constant, commute the V’s: 


F(x) = -V (p VV) = -p V (VV) 

= (P • V) E(x). (3.112) 

Whichever of(3.111)or(3.112)is more convenient may be used to calculate F. 

We may explain one of the most basic demonstrations of static electricity by 
(3.111). A piece of amber, rubbed with a wool cloth, becomes charged, and will 
attract small pieces of paper. 18 But the paper is electrically neutral. Why does a 
charge Q attract a neutral object? The reason is that the object becomes polarized 
in the electric field of Q, and so has a dipole moment parallel to the field. The 
energy U decreases (becomes more negative) as the object approaches Q, because 
both |p| and |E| increase. The force —Vt/ is in the direction of decreasing U, 
toward Q. 


3.9 ■ APPLICATIONS 

Applications of electrostatic fields range from large-scale ones using hundreds of 
kilowatts, e.g., precipitation of smokestack particles, to small-scale applications 
using about 1 watt, e.g., deposition of fibers on cloth or paper. The general prin¬ 
ciple in these applications is to put a charge on a particle in order to control its 
motion by an applied electrostatic field. 

18 Thales of Miletus (6th century BCE) is supposed to have discovered static electricity by observing 
that when amber is rubbed with animal fur, it attracts bits of feathers and the pith of plants. Thales is 
considered to be the first Western philosopher, and to him are attributed many early discoveries and 
ideas on science and mathematics. 
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Probably the most important industrial application is the removal of small solid 
particles from the efflux of smokestacks at large power plants. Cottrell electro¬ 
static precipitators, which remove soot, smoke, fly ash, and other particle pollu¬ 
tants, are used. In the usual design the particles to be removed are first charged 
by passing them through ionized air; the ions attach themselves to the particles or 
transfer their charges upon collision. The charged solid particles are then collected 
on planes or coaxial charged conductors and thus prevented from leaving with the 
effluent gases. A typical large industrial process uses an electrostatic field with 
voltage of order 100 kV and collection current of about 2 A. Essentially the same 
process, but at lower voltage and current, is also used for smaller smokestacks, in 
which the purpose is to recover valuable dusts from the flue gases. 

Another spectacular and commercially important application of electrostatic 
fields is in automobile painting. In practice liquid paint is fed onto a spinning, con¬ 
ducting disk or cone which is at a high voltage, say, 250 kV. The automobile body 
to be painted is nearby and kept grounded, i.e., at potential zero. Charged droplets 
spin off the disk by centrifugal force. The electric field between the charged disk 
and the automobile body exerts a force on the charged droplets toward the car, 
and deposits them there; a typical current is 5 mA. When operating properly, es¬ 
sentially all the paint is deposited on the car. 

Some smaller-scale, but also commercially useful applications are: (a) Deposit¬ 
ing the abrasive particles in sandpaper production; the electrostatic process has the 
desirable effect of tending to expose the sharp points of the particles; (b) Ink jet 
printers, which achieve remarkable control for depositing charged ink droplets 
on paper by using electrostatic fields; (c) Electrostatic fields used in depositing 
charged fiber-particles as flocking onto tee shirts, greeting cards, wallpaper, etc. 

In most such applications the force is on a charged particle, as in Eq. (3.6). One 
can also control the motion of uncharged polarized particles by using nonuniform 
fields, as in (3.111). However, this is more difficult because the force is weaker. 
Finally, it is important to remember that although electrostatic control of charged 
particles is straightforward in principle, these applications require sophisticated 
engineering in practice. 


3.10 ■ CHAPTER SUMMARY 

Electrostatics is concerned with three functions, p(x), E(x), and V (x). To proceed 
to the next few chapters, it is necessary to comprehend the six relations among 
these three functions. For each pair of functions there is an integral formula and a 
partial differential equation. 


1. Density and Field: 


E = 


47T60 




pdV 


and V-E = p/eo- 
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The electric field is determined by the density by the integral formula (3.15), 
which is rewritten in a concise form above. The partial differential equation, 
which must hold at every point in space, is Gauss’s Law. These are not inde¬ 
pendent principles. Either one may be derived from the other. 

2. Field and Potential: 


V(x) = — f E ■ dl and E = — W. 

Jxo 


The reason a static E may always be written as a gradient, and that the line 
integral is independent of the path from xo to x, is that V x E = 0. 

3. Potential and Density: 


1 

47T6 0 



- V 2 V = p/€ Q. 


The integral formula for V (x) is (3.60), rewritten in a concise form above. The 
local equation is Poisson’s equation. The integral is just Green’s solution to 
Poisson’s equation, discussed in Section 3.5. 


FURTHER READING 

The topics in this chapter, being basic, are covered in most textbooks of electricity 
and magnetism. For a student who wants to study other treatments of the subject, 
the books below are recommended. 

D. J. Griffiths, Introduction to Electrodynamics, 3rd ed. (Prentice-Hall, Upper Saddle River 
NJ, 1999). [A popular textbook with many example calculations.] 

J. R. Reitz, F. J. Milford, and R. W. Christy, Foundations of Electromagnetic Theory, 4th ed. 
(Addison Wesley, San Francisco CA, 1993). 

J. C. Slater and N. H. Frank, Electromagnetism (McGraw-Hill, New York, 1947). [A classic 
text that emphasizes physical insight.] 

R. H. Good, Classical Electromagnetism (Saunders, Fort Worth TX, 1999). 

P. Lorrain, D. R. Corson, and F. Lorrain, Fundamentals of Electromagnetic Phenomena 
(Freeman, New York, 2000). 

J. D. Jackson, Classical Electrodynamics, 3rd ed. (Wiley, New York, 1999). [The definitive 
graduate-level text.] 


EXERCISES 

Sec. 3.1. Coulomb’s Law 

3.1. Two protons interact, separated by distance d. Calculate the ratio of the electric force 
to the gravitational force for one of the protons. 
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3.2. The quantity 1 /4n€§ is the constant of proportionality between the electric force 
and qxq'i/r'*'■ The parameter 6 q is called the permittivity of the vacuum. Its value 
depends on the system of units being used. 

(a) Use dimensional analysis to determine the SI units of (q. 

(b) In SI units, 1 /4;reo is defined to be 10 _7 c 2 Ns 2 /C 2 where c is the speed of light. 
From this, evaluate (q. 

Sec. 3.2. The Electric Field 

3.3. A proton moves in a uniform electric field E()k. The proton is released at the origin 
with initial velocity vo i. What is the position of the proton at time /? 

3.4. For the charge configuration i n Example 1: (a) find the electric field along the y axis; 
(b) find the electric field along the z axis. 

3.5. For the line charge of Example 2, find the electric field along the z axis for z > t ■ 

3.6. Starting from the result of Example 3, equation (3.20): 

(a) Find E along the axis of a disk of radius R which has a constant surface charge 
density a. (Build up the disk from elemental rings, and integrate the contribu¬ 
tions from the rings.) 

(b) From the result of (a) find E(x) due to an infinite plane with surface charge 
density a. 

3.7. Charges +q and —q are located on the z axis at (0, 0,a) and (0,0, —a) respectively. 
Determine E(x) on the x axis, i.e., x = (x, 0, 0). Use computer graphics to plot 
E z (x) as a function of x/a. 

3.8. A semi-infinite wire lies on the negative z axis, from z — 0 to z = — oo, with 
constant linear charge density X. 

(a) Determine E at a point (0,0, z) on the positive z axis. 

(b) Determine E at a point (x, 0, 0) on the positive x axis. 

3.9. A long, uniformly charged ribbon is located in the xz plane, parallel to the z axis, 
occupying the region—oo < z < oo and—a/2 < x < a/2.The charge per unit area 
on the ribbon is a. 

(a) Determine E at (x, 0, 0), where x > a/2. What is the asymptotic field on the x 
axis? 

(b) Determine E at (0, y, 0), where y > 0. What is the asymptotic field on the y 
axis? 

Sec. 3.4. The Integral Form of Gauss’s Law 

3.10. There is a point charge Q at the origin. 

(a) Calculate the flux of E through a cube centered at the origin and aligned with the 
Cartesian axes. Evaluate the surface integral directly, and verify that the result 
agrees with Gauss’s Law. 

(b) Calculate the flux of E through a cube with one vertex at the origin, and with 
the rest of the cube in the octant with x, y, z > 0. Evaluate the surface integral 
directly. How does the result square with Gauss’s Law? 
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3.11. (a) Using Gauss’s Law in integral form, prove that the electric field outside 

a uniformly charged spherical shell, with total charge Q and radius a , is 
Qr/(47te 0 r 2 )- 

(b) Prove that the electric field inside the shell is 0. Note that these results are the 
same as obtained in Example 4 by integrating over the charge distribution. 

3.12. A charge +q is at (0, 0, zo). 

(a) What is E(x)? 

(b) By explicitly integrating E n (\) over the surface of a sphere centered at the origin 
and with radius R > zo, show that f E ■ n dA equals <yo/ e O as it must by Gauss’s 
Law. (This is not an easy exercise.) 

3.13. The integral and differential forms of Gauss’s Law are equivalent, because either 
one implies the other. Derive the differential form of Gauss’s Law from the integral 
form, by letting S be the surface of an infinitesimal cube. 

3.14. Two large plates are parallel, have uniform surface charge density +cr and —a, and 
are separated by distance d. 

(a) Determine the electric field between the plates far from the edges. 

(b) Determine the force per unit area on either plate, neglecting edge effects. (Hint: 
A charged plate does not exert a force on itself.) 

3.15. A hollow cylinder of radius R is uniformly charged with surface charge density a. 
Use Gauss’s Law to determine E for r < R and r > R. 

3.16. Two concentric thin spherical shells, with radii a and b > a, have uniformly dis¬ 
tributed charges +Q and — Q respectively. What is E(x) in the region a < r < bl 
How does E(x) change if the outer shell is discharged? 

Sec. 3.5. Green’s Function and the Dirac delta Function 

3.17. (a) Prove that S(ax + b)f(x) dx = f(—b/a)/\a\. 

(b) From (a) prove that S(x - y)f(y)dy = fix). 

(c) Prove that f 5 3 (Ax + b)/(x)d 3 .r = /(—A _l b)/|Det A|, where A is a 3 x 3 
matrix. (Hint: When you change variables from x to x', remember the Jacobian 
factor J in d^x = J d^x'.) 

(d) From (c) or (b) prove that f 5 3 (x — y)/(y)d 3 y = /(x). 

3.18. Consider d(x, e) = exp(—x 2 /e)/v^re. 

(a) Use computer graphics to plot d(x, e) versus x for e = 0.3, 1.0, and 3.0. 

(b) Show that lim f ._>o d(x,e) = 0 for x ^ 0. 

(c) Show that lim f _,od(x, e) =ooforx = 0. 

(d) Determine f^^dix, e)dx for arbitrary e. 

(e) Explain the meaning of the statement linif^o d(x, e) = Six). 

3.19. (a) In one dimension (x), let d a (x) be 1 /2 a for —a < x < a , and 0 for |x| > a. 

Prove that lim*, _>o d a (x) = 5(x). 

(b) In three dimensions let d a (x) be 1/ V for r < a, and 0 for r > a; here r denotes 
|x| and V = j7ra 3 . Prove that lim a ^.o^a( x ) = 
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(c) Consider a uniformly charged sphere with total charge Q and radius a\ that is, 
the charge density is p(x) = Qd a (x). Find the potential function VCx j for r < a 
and r > a. 

(d) Verify by explicit calculation that V(x) obeys Laplace’s equation (V 2 V = 0) 
forr > a and Poisson’s equation for r < a. 

(e) Sketch plots of V versus r, for several values of a, showing how V (x) ap¬ 
proaches Q/(47i(Qr) as a —*■ 0. 

Sec. 3.6. Electric Potential 

3.20. (a) Consider the potential function C • x where C is a constant vector. What is E(x)? 
(b) What is the potential V(x) for a uniform electric field of strength Eq and direc¬ 
tion k? 

3.21. Show that the electrostatic potential of a charged circular ring with total charge Q 
and radius a , on the axis of symmetry of the ring, is V(z) = Q/(4n€Qy/z 2 + a 2 ), 
where z is the distance from the center of the ring. From V (z) determine the electric 
field on the axis of symmetry. 

3.22. Use the integral (3.32) to determine the potential V(x) both inside and outside a 
uniformly charged spherical surface, with total charge Q and radius R. (Hint: Divide 
the sphere into rings, and use the result given in Exercise 21.) From V(x) calculate 
the electric field. Sketch plots of V (r) and E r (r) versus r. Notice that the potential 
is continuous at r = R, but the electric field is discontinuous because of the surface 
charge. 

3.23. Imagine a spherically symmetric charge density 

, ( Cr for r < a 

P(x) = | 0 for r > a. 

(a) Determine the electric field E(x) and potential V (r). Notice that V(r) and E(x) 
are continuous at r = a. 

(b) Now suppose additional charge i s placed uniformly on the surface at r = a, with 
surface density op- ( CT 0 has units C/m 2 .) Determine E(x) and V (r) for this case. 
E(x) is discontinuous at r = a, but V(r) is continuous. Explain why. 

3.24. Consider the field 

E = i (lx 2 - 2xy - 2 y 2 ) + j x 2 - 4.ry + y 2 ) . 

Is it irrotational? If so, what is the potential function? Also, calculate V ■ E. 

3.25. Find the restrictions on C i and Ci such that this function may be a potential function 
in a charge-free region of space: V(r, 9) — (C\ cos 2 6 + C' 2 )/r 2 . 

Sec. 3.7. Energy of the Electric Field 

3.26. Calculate the self-energy of a charged spherical surface, with total charge Q uni¬ 
formly distributed on the surface, and radius R. What is the physical significance of 
this result? What is the limit of the self-energy as R —> 0? 
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3.27. Evaluate the interaction energy U \2 = f() / El ■ E 2 d^x for two point charges sepa¬ 
rated by d. Ei is the field of q\ and E 2 is that of <p>- (Hint: U \2 depends only on d, 
so without loss of generality let <71 be at the origin, and qj on the z axis at z = d.) 

3.28. (a) Show that the electrostatic energy of a uniformly charged solid sphere, with total 

charge Q and radius R , is 3Q 2 /(20jt€qR). 

(b) Use (a) to compute the electrostatic energy of an atomic nucleus [charge = Ze, 
radius = (1.2 x 10 “ 15 m)A'/ 3 ] in MeV times Z^A 1 / 3 . 

(c) Calculate the change of electrostatic energy when a uranium nucleus (Z = 92, 
A = 238) fissions into two equal fragments. 

3.29. Calculate the energy per unit length for two long coaxial cylindrical shells, neglect¬ 
ing end effects. The inner and outer cylinders have radii a and b , and linear charge 
densities A and —A, uniformly distributed on the surface, respectively. 

Sec. 3.8. The Multipole Expansion 

3.30. Let V (z) be the potential of a ring of charge on the axis of symmetry at distance z 
from the center. (See Exercise 21.) Obtain the first two nonvanishing terms in V(z) 
for z 2> the radius of the ring. To which multipoles do they correspond? Can you 
see by symmetry that the dipole moment is 0 ? 

3.31. Charges ±q separated by distance d make a dipole with dipole moment qd. For 
large r the potential approaches qd cos 6/(Ane^r 2 ). For 9 = 0, how large must r be 
so that the asymptotic function is accurate to 1 %? 

3.32. A point dipole p = pk is at the origin. 

(a) At point Pj = (x\ , 0, 0) there is a point charge q. What is the force on this 
charge due to the field of the dipole? What is the force on the dipole? 

How much work is required to take the charge from P\ to infinity if the dipole 
remains fixed at the origin? How much work is required to take the dipole from 
the origin to infinity if the charge remains fixed at P\ ? What is the physics 
underlying the simple answers to these questions? 

(b) The point charge is now moved from P\ to another point P 2 = (.<.' 2 , >’ 2 , Z 2 ) 
while the dipole remains fixed. How much work is required to move the charge? 
What is the force on q when it is at P 2 ? 

3.33. The dipole p = Pi i + P 2 j + P 3 k is located at the origin. 

(a) What is the potential V (r, 9, <p) in spherical coordinates? 

(b) What is the electric field E (r, 9. </>)? 

3.34. A point dipole p = pok > s at the origin. A second dipole p = pqk is at (0, 0, zo)- 

(a) What is the force on the second dipole? 

(b) What is the interaction energy? [Hint: The energy is the work required to bring 
the second dipole to ( 0 , 0 , zq) with the first dipole fixed.] 

(c) Show that the energy is —p • E(0, 0, zo). 

[Answer: (a) F z = -3p^/(27re 0 Zo); (b) U = -po/(2jre 0 z 3 ).] 

3.35. Figure 3.16 shows three charged line segments, each with linear charge density A, 
extending from the origin O to (a, 0 , 0 ), from O to ( 0 , b, 0 ), and from O to ( 0 , 0 , c). 
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FIGURE 3.16 Three charged rods. 


(a) Find the dipole moment of this charge distribution. 

(b) Find the first two terms in the multipole expansion of the potential V (0, 0, z) on 
the z axis, for z 2> a, b, c. 

(c) What are the monopole and dipole contributions to E(0, 0, z) for z » a, b, cl 

3.36. Repeat the previous exercise (35) if the line charges are placed symmetrically around 
the origin, i.e., from (—a/2, 0, 0) to (+a/2, 0, 0), from (0, — 6/2,0) to (0, +b/ 2, 0), 
and from (0, 0, —c/2) to (0, 0, +c/2). 

[Answer: (b) V (0, 0, z) = X(a + b + c)/(4n(Qz) — k(a 3 + b 3 — 2c 3 )/(967reoz 3 )-] 

3.37. Derive (3.100), the electric field of an electric dipole, from the dipole potential 
(3.98). Calculate — V(p • x/r 3 ) either by using spherical coordinates or by the del 
identity for the gradient of a product. Sketch the electric field lines. 

3.38. A linear quadrupole consists of three charges: q , —2 q, and q, on the z axis. The 
positive charges are at z = ±d and the negative charge is at the origin. 

(a) Show that this system is the same as two dipoles, with dipole moments +qdit 
and —qd k, centered atz = +d/2 andz = —d/2, respectively. 

(b) Calculate the potential V (r, 9) in spherical coordinates for r d. 

(c) Sketch the electric field lines in the xz plane. 

3.39. Consider the electric quadrupole consisting of four charges in the xy plane: +q at 
{x, y) = (a, 0), -q at (0, a), +q at (—a, 0), and -q at (0, -a). 

(a) Determine the electric potential on the x axis. What is the asymptotic form of 
V(x, 0, 0)? Use computer graphics to make a log-log plot of V versus x and 
verify the asymptotic behavior. 

(b) Determine the electric field on the x axis for x » a. 

(c) Determine the electric field on the z axis. 

General Exercises 

3.40. Imagine three concentric charged hollow spheres, with radii R, 2R, and 3 R and 
charges +Q, —Q, and +Q, respectively. 
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(a) Determine the electric field at four points a, b, c, d; these are all on a radial 
line at distances 0.5 R, 1.5 R, 2.5 R, and 3.5 R from the center of the system, 
respectively. 

(b) Determine the ratio E^/E^. [Answer: 9/49] 

3.41. A large flat plate with constant surface charge density a lies in the xy plane. What 
is the potential difference between the points (0, 0, d) and (d. d, 2d), which are far 
from the edges of the plate? [Answer: —ad/(2 cq)] 

3.42. A solid sphere of radius R centered at the origin is uniformly charged with charge 
density p. Then a small sphere is removed, making a spherical cavity centered at 
a point a within the sphere. Show that the electric field in the cavity is pa/3eo- 
(Surprisingly, it is uniform.) 

3.43. A model for the electrostatic potential of an atom, due to the nucleus (charge +Ze) 
and electrons, is the so-called “screened Coulomb potential,” 


V(x) = 


Ze c -r/a 
dneiyr 


in which a is an effective atomic radius and r = |x|. 

(a) What is the charge density p(x)? (Hint: If you calculate — V 2 V naively, e.g., us¬ 
ing spherical polar coordinates, you will get the charge density of the electrons, 
but not the nuclear charge density, which is a delta function at x = 0. Note that 
V diverges as 1 /r as r -* 0, which by Gauss’s Law implies a point charge at 
the origin.) 

(b) Sketch a graph of p versus r. 

(c) Show that the total charge is 0. 

3.44. Determine the potential on the axis of symmetry of a uniformly charged disk. Ex¬ 
pand the result in the region of z much larger than the disk’s radius and show that 
the leading term is Q/(4neQz), as expected, where Q is the total charge on the disk. 

3.45. For a uniformly charged line, determine E(x) in two ways: by integrating over the 
charge distribution, and by applying Gauss’s Law. 


Computer Exercises 

3.46. We can use numerical integration to evaluate the potential V(x) for a given charge 
distribution p(x), when p(x) is too complex for an easy analytic calculation. For 
example, consider a uniformly charged wire in the form of one turn of a spiral in the 
xy plane. The equation for the spiral is, in plane polar coordinates, r(G) — a(\ — 
6/4n), for 0 < 9 < 2n. At 9 = 0 the radius is a; at 0 = 2 n it is a/2. Find 
the potential at the origin. Use a computer program to evaluate the integral over the 
charge distribution numerically. 

3.47. Use a computer program to make a contour plot, for points in the xy plane, of the 
electric potential of a system of three point charges. Let the charges be q, 2q, and 
3 q, located at the vertices of an equilateral triangle in the xy plane: (a/2, 0, 0), 
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(—a/2, 0,0), and (0, \fia/2, 0) respectively. The potential is 


V(x, y, 0) 


<? 

4neoa 


fix, y) 


where / ( x , y) is a dimensionless function; set a = 1 to make a the unit of length. 
For example, some graphics programs have a contour plotting command, which can 
be used to plot the contours of f (x,y). These are the equipotentials of the system. 

3.48. Consider a dipole, consisting of charges ± 1 pC at positions (0, 0, ± 1 cm) respec¬ 
tively. The dipole moment is 2 x 10 -8 Cm. Use a computer program to plot the 
electric field strength E(x, y, 0) for points in the xy plane, as a function of r = 
\Jx~ + y 2 . Show on the same plot the electric dipole approximation for E(x, y, 0). 
For what range of r is the dipole approximation accurate to 10%? 


3.49. A wire is bent into a circular ring of radius R and given total charge Q. The cross 
section of the wire is negligible. Let the ring lie in the xy plane (z = 0) and use 
spherical polar coordinates (r, 8, </>) forpoints x off the ring. By rotational symmetry 
about the z axis, the potential V (x) is independent of the azimuthal angle </>; that is, 
V = V(r, 8). The potential on the axis of the ring (6 = 0) was calculated in Exercise 
21. To calculate V off the axis requires a computer. 

(a) Show that the potential function is 


V(r, 8) = 


Q 1 r 2x d<p' 

47re() 2rr Jq r 2 + /? 2 — 2rR sin 9 cos <p' 


(b) The (p r integral can be evaluated numerically, e.g., with a computer program. A 
reasonable approximation is to replace the integral by a sum 


V(r, 8) 


Q 1 

47T€q N 


N 


E 


_i_ 

y/r 2 + R 2 — 2rR sin# cos(2nn/N) 


where A is a sufficiently large integer. For r — 2R plot V(r, 8) as a function of 
8, and explain the dependence on 8 intuitively. (The domain of 8 is (0, 7r).) Use 
Q/(47te(jR) as the unit of potential. 

(c) For r = R/2 plot V (r, 8) as a function of 8, and explain the dependence on 8 
intuitively. 
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The characteristic feature of a conductor is that some of its electrons are free to 
move macroscopic distances. These conduction electrons come from the atoms of 
the material, so there is also a lattice of positive ions, which move only by small 
vibrations about their equilibrium positions. In the absence of an electric field the 
free electrons are uniformly distributed and in rapid random motion, interacting 
with the lattice and each other. But if an electric field is applied to a conducting 
body, the free electrons inside will move in response to the electric force until a 
new equilibrium is reached, which happens when E = 0 inside the conductor. 
The field outside the conductor is then different from the original field because 
it includes the Coulomb-field contributions of the displaced electrons. For this 
reason conductors play an important role in electrostatics. 

To understand the physics of the approach to equilibrium, picture a conductor— 
to be specific let it be an uncharged sphere centered at the origin—to which is 
applied a uniform field E = £k in the +z direction. Electrons in the conductor 
are subject to the force F = — eE, so the free electrons move in the —z direction. 
They will accumulate somewhere on the conductor, displaced in the — z direction 
from their original positions. A nonzero charge density develops, with a field that 
tends to cancel the applied field inside the conductor. The electrons stop moving, 
i.e., electrostatic equilibrium is reached, when the vector sum of all field contribu¬ 
tions is zero everywhere in the conductor. We will solve this specific problem in 
Sec. 4.3, find that the displaced charges accumulate on the surface of the sphere, 
and calculate the surface charge density. 

We usually think of conductors as metals. On a microscopic scale the conduc¬ 
tion electrons inside a metal are in the periodic potential of a lattice of positive 
metallic ions. Their freedom to move, in other words, the high electrical conduc¬ 
tivity of metals, can be explained by quantum mechanics. In a typical metal the 
conductivity is large, so the electrostatic equilibrium discussed in the previous 
paragraph is achieved in a very short time. 1 For example, in Cu the characteristic 
time for reaching equilibrium is about 10 -19 s. Therefore, except for processes 
at very high frequencies, of order 10 19 Hz and higher, equilibrium may be con¬ 
sidered to be instantaneous. Semiconductors, e.g., graphite, also have conduction 
electrons, so their electrostatic behavior is similar to metals. There is active ex¬ 
perimental and theoretical research on electrical properties of metals and semi¬ 
conductors, about which much is understood, but also a lot is not yet understood. 

1 See Sec. 7.6. 
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4.1 ■ ELECTROSTATIC PROPERTIES OF CONDUCTORS 

In this section we describe some general properties of conductors in equilibrium 
with static electric fields. These results will be used in the analysis of specific 
calculations later. In each case we’ll state the property, and then explain the theory 
behind the statement. 

The Electric Field Is Zero Inside a Conductor in Electrostatic Equilibrium. If 
E were nonzero then free charges would accelerate and the system would not be 
static. 

The Electric Potential Is Constant Inside a Conductor in Electrostatic Equilib¬ 
rium. Recall that the potential difference between two points is the work per unit 
charge required to move a test charge from one to the other against the electric 
force. But in a conductor E is zero, so there is no force on the test charge. No 
work is required to move the charge, the potential difference between any two 
internal points is zero, and we have V(x) = Vo, a constant. In other words, the 
entire conductor is at potential Vo. The surface of the conductor, which is also at 
potential Vo, is an equipotential surface. 

The Charge Density Is Zero Inside a Conductor in Electrostatic Equilibrium. 
Gauss’s Law states that the flux of electric field through any closed surface is 
proportional to the charge enclosed by the surface. If a Gaussian surface lies en¬ 
tirely inside a conductor, then the flux is 0 because E = 0. Thus the net charge 
contained in any volume inside the conductor is 0. In other words, p(x) =0 in¬ 
side the conductor. Any excess charge on a conductor must therefore reside on its 
surface, and must be described by a surface charge density a. 

Consider this question: If electrons (negative charge) appear on the surface of 
an uncharged conductor in an electric field, how can there then be zero charge 
density inside the conductor? Where do the electrons come from? The answer is 
that the electrons can be considered as coming from elsewhere on the surface of 
the conductor. The surface charges have been redistributed. An equivalent amount 
of positive charge must appear at surface points from which the electrons moved. 
We may therefore speak freely of flow of either negative or positive charge. 

Either the Net Charge, or the Potential, of a Conductor in an Electrostatic 
System Can Be Specified. An isolated conductor may carry on its surface a net 
charge, negative or positive, or it may be uncharged. In an experiment the charge 
on a conductor is set by adding or removing electrons. If the conductor is then 
placed in an external field, or near other charges or conductors, its total charge 
stays the same, although the surface charge distribution changes so as to maintain 
E = 0 inside the conductor. In such an experiment we specify that the total charge 
on the conductor is fixed at some value Qo. However, its potential and the surface 
charge distribution are determined by the other charges and conductors in the 
system. 
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Instead of specifying the charge on a conductor we may specify its potential. 
For example, a conductor connected to the Earth is said to be at zero potential, 
or grounded. The idea is that the Earth is a very large conductor, defined to be at 
zero potential itself, and it serves as a source or sink of electrons without its own 
potential being changed. In an experiment a conductor can be set to any specified 
potential Vo by connecting one terminal of an appropriate dc-voltage source, say 
a battery, to the Earth and the other terminal to the conductor, as illustrated in 
Fig. 4.1. If the conductor is now placed in an external field, or near other charges 
or conductors, it will remain at the specified potential, although charge may be 
exchanged through the voltage source between the conductor and the Earth. The 
charge will distribute itself over the surface of the conductor so as to maintain 
E = 0 in the conductor. In such an experiment we specify that the potential of the 
conductor is fixed at Vo. However, its net charge and surface charge distribution 
are determined by the other charges and conductors in the system. 

At the Surface of a Conductor, E ta ngentiai = 0 and Enormai = cr/e o- The surface 
of a conductor in electrostatic equilibrium is an equipotential, as we saw above, 
so no work is required to move a charge along the surface; i.e., the tangential 
electric force is 0. Recall from Sec. 3.3.2 that the tangential components of the 
electric field are continuous across any surface (conducting or insulating) because 
V x E = 0. Therefore E f is 0 just inside, just outside, and on the surface of a 
conductor. Indeed, if E f were not zero on the surface, then there would be a flow 
of surface charge rather than static equilibrium. 

However, in general there is a normal component E n , i.e., perpendicular to 
the surface of a conductor. We can determine E n by using the integral form of 
Gauss’s Law. Draw a cylindrical Gaussian surface, with infinitesimal area dA and 
infinitesimal height, straddling the boundary surface as shown in Fig. 4.2. The 
electric flux across the flat surface inside the conductor is zero and that across 
the flat surface outside is E n dA. The charge enclosed by the Gaussian surface is 
ad A, where a is the local surface charge density. Therefore, Gauss’s Law implies 

E n =a/e 0 . (4.1) 

This is a special case of the boundary condition for E n discussed in Sec. 3.3.2. 
Equation (4.1) is an important relation because we will use it to calculate the 



FIGURE 4.1 A conductor held at potential Vq. 
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FIGURE 4.2 Gaussian surface at a conducting surface. 


surface charge density on a conductor from the field. Alternatively, in terms of the 
potential function, a = —eon • V V because E = — VV. 

There is an outward force per unit area on the surface of a conductor, given by 



F/A may be called the electrostatic pressure. The ultimate origin of this force 
is the Coulomb repulsion between like charges. To calculate the electric force on 
charges at the surface is subtle. Naively the force on a small area A of the surface 
is QE = a AE. But E is discontinuous at the surface: It is 0 inside and (cr/eo)n 
outside. Which value should we use? One way to justify (4.2) is to realize that 
the field (4.1) is due partly to the local surface charges themselves, i.e., those 
in the immediate area, and partly to all the other charges on the conductor. But 
in calculating the force on the local surface charges in a small area, we should 
not include their own contribution to the field because that would be a self force. 
The contribution from local charges is E n (local) = a/2eo by (3.46), because 
near the surface the local charge density looks like an infinite plane of charge. 
The contribution to (4.1) from the other charges must be E n (nonlocal) = a/2eo- 
Therefore F/A = oE n (nonlocal) = a 2 /2eo- 

A second way to understand (4.2) is by an energy argument. Suppose the small 
area A is moved outward by distance 81. The work done by the electrostatic force, 
F n 8l, would be —8U where U is the electrostatic energy. The energy density is 
2^0 E 2 , and is nonzero only in the region outside the conductor, because E = 
0 inside. As A moves outward, the volume with nonzero energy density would 
decrease by A81 , so the energy would decrease, with 

SU = -j€ 0 E 2 A8i 


where E = a/e o- Equating F n = —8U/8i leads to (4.2). 
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The Potential V (x) throughout Space Is Determined (Uniquely up to an Addi¬ 
tive Constant) When (a) The External Charge Densities, and (b) Either the Poten¬ 
tials or the Total Charges of All Conductors, Are Specified. The physical idea here 
is that once all the available physical parameters of an electrostatic system have 
been set, then the fields are determined by the laws of nature. This is a statement 
of the uniqueness theorem. The practical importance of the uniqueness theorem is 
that when a potential function satisfying the equations and boundary conditions 
has been found, by any method, then we need look no further. Any other func¬ 
tion satisfying these conditions can only differ by an additive constant. Potential 
differences and the electric field are unique; these experimental observables are 
not affected by an arbitrary constant in V (x). We will frequently appeal to the 
uniqueness theorem in solving problems. 


Laplace's Equation, Boundary Conditions, and the Uniqueness Theorem 
The potential function V (x) in a region of space where the charge density is zero, 
satisfies the partial differential equation V 2 V = 0, which is called Laplace’s 
equation. In a system of charged conductors, V(x) satisfies Laplace’s equation 
outside the conductors, and certain boundary conditions on the conductor sur¬ 
faces. If the potential of a conductor is specified to be Vo, then the boundary 
condition is V (x) = Vo, which is an example of a Dirichlet boundary condition. 
If the charge density on a surface is specified, then the boundary condition is 
—n • VV = cr/eo, which is called a Neumann boundary condition. It is a classi¬ 
cal theorem of mathematical analysis that Laplace’s equation, with a complete set 
of Dirichlet and/or Neumann boundary conditions, has a unique solution. 2 If the 
net charge of a conductor is specified we may not know its distribution until the 
problem is solved. However, we do know that V (x) is constant on the surface, and 
from the specified charge the value of the surface potential will be determined. 

As an example of the use of the uniqueness theorem, we shall prove this in¬ 
teresting result: The electric field is zero in any empty cavity in a conductor in 
electrostatic equilibrium, regardless of the electric field outside the conductor. 
(See Fig. 4.3.) To prove the claim, we merely note that the potential function 



FIGURE 4.3 A cavity in a conductor. Si is an equipotential surface just inside the cavity. 

2 A mathematical proof of the uniqueness theorem for Laplace’s equation with Dirichlet and Neumann 
boundary conditions is given in Chapter 5. 
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V(x) = Vo, where Vo is the constant potential of the conductor, does satisfy 
Laplace’s equation in the cavity and does have the proper value on the surround¬ 
ing surface. Hence, by the uniqueness theorem, Vo must be the potential function 
throughout the cavity. The field in the cavity is then E = — Wo = 0. A conductor 
shields a cavity in its interior from any external electrostatic fields. 

It is worthwhile to analyze this result in more detail. What happens if the con¬ 
ductor is charged? We will prove: If a charged conductor has within it an empty 
cavity, the field in the cavity is 0 and the charge on the cavity wall is 0. We already 
know that the net charge of a conductor must lie on its surface. The claim here is 
that it lies entirely on the outer surface. 

Rather than just appealing to the uniqueness theorem, let’s now analyze the 
potential physically. Let Vo be the potential of the conductor, and Vi the potential 
of an equipotential surface S\ located just inside the cavity, illustrated in Fig. 4.3. 
If Vi > Vo, then E points outward at all points of Si . But that leads to a contra¬ 
diction, because Gauss’s Law implies there is a positive charge enclosed by Si, 
contradicting the statement that the cavity is empty. So Vi cannot be larger than 
Vo; and similarly it cannot be less than Vo because that would imply a negative 
charge enclosed. Therefore Vi = Vo, and by extension the same is true for any 
equipotential surface in the cavity. Hence V (x) = Vo and E(x) = 0 in the cavity. 
Now consider a Gaussian surface in the shape of a small pill box straddling the 
wall of the cavity. Since E = 0 on both sides of the wall, the flux through the pill 
box is 0, and by Gauss’s Law the charge on the wall must be 0. 

This result, that E = 0 inside the cavity, is ultimately a consequence of 
Coulomb’s Law F oc 1/r 2 , i.e., that the force between charges varies inversely 
with the square of the distance. We derived the result from Gauss’s Law, but 
Gauss’s Law and Coulomb’s Law are equivalent in that either one implies the 
other. If the electric force were proportional to r~ p with p f 2, then Gauss’s 
Law would not be true, and E would be nonzero in the cavity. 

The most sensitive experimental tests of the inverse square law (p = 2) have 
come from experiments that attempted to detect an electric force inside a cavity 
in a charged conductor. Cavendish 3 did one of the early experiments, in which 
a metal sphere was surrounded by a pair of hemispherical metal shells, and con¬ 
nected to the shells by a wire. Then charge was placed on the outer shells. If E 
were nonzero inside the conducting shell, some free charge would flow to the 
metal sphere inside. The hemispheres were then disconnected and removed, and 
the charge of the metal sphere was measured. This and all later experiments had a 
null result. Knowing the precision of the experiment, the null result sets an upper 
bound on Ip — 2|, shown for several experiments in Table 4.1. 4 


Comment on General Electrostatics Problems 

The most general electrostatics problem involving charges and conductors is to 
find E(x), or equivalently V(x), for an arbitrary collection of charges and con- 


2 Henry Cavendish, b. 1731, d. 1810. 

4 In quantum electrodynamics the inverse square law F <x 1/r 2 means that photons have no mass. 
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TABLE 4.1 Tests of Coulomb’s Law 


Experimenters 

Date 

|p — 2| bound 

Cavendish 

1773 

< 2 x 10“ 2 

Maxwell 

1873 

< 5 x10~ 5 

Plimpton and Lawton 

1936 

< 2 x 10 -9 

Bartlett, Goldhagen and Phillips 

1970 

< 1.3 x 10-' 6 

Williams, Faller and Hill 

1971 

< 1 x 10~ 16 


E.R. Williams, J.G. Faller, and H. Hill, Phys. Rev. Lett. 26, 721 (1971). 

A.S. Goldhaber and M.M. Nieto, “The mass of the photon”, Sci. Am. (May, 1976). 


ductors. One important source of difficulty in such problems is that we do not 
know a priori the charge distribution a (x) on the surfaces of the conductors; in¬ 
deed part of the problem is to determine a (x) from the conditions of the system. If 
the charge distributions were known then the problem would reduce to calculating 
the integral in (3.15) or (3.60). What we do know is E = 0 inside the conductors. 
That information together with the partial differential equations must somehow 
be used to find the complete field. An electrostatics problem is a boundary value 
problem. In the examples and exercises of this chapter we will consider some 
special, highly symmetric problems, that can be solved analytically. It should be 
understood, however, that problems of a more general form can be very difficult, 
and often require the use of numerical methods for their solution. In Chapter 5 we 
will develop some general methods for boundary value problems in electrostatics. 


4.2 ■ ELECTROSTATIC PROBLEMS WITH RECTANGULAR SYMMETRY 

General electrostatics problems, such as those that are important in applications, 
are often unsymmetrical; the conductors and charge distributions may have a com¬ 
plicated geometry. Such difficult problems must be solved by numerical methods, 
or by building and measuring analog models, or by other advanced techniques. In 
contrast, in the next sections we discuss some simple systems possessing symme¬ 
try, for which analytic solutions can be found. By studying such systems, which 
can be analyzed in detail, we will gain insight into the whole subject that will 
help to understand more complex systems. We’ll consider rectangular symmetry 
in this section and spherical and cylindrical symmetry in later sections. 

4.2.1 ■ Charged Plates 

As a start, consider two plane conducting plates 1 and 2 separated by a vacuum 
gap, as shown in Fig. 4.4. The lower conductor is grounded, i.e., V\ = 0, and the 
upper conductor is maintained at some positive potential Vo, i.e., V 2 = Vo > 0. 
We ask: What are the potential, field, and surface charge densities? In Fig. 4.4 the 
plates have thicknesses d\ and d 2 , but for the present we are concerned mainly 
with the boundary planes z = 0 and z = d. We assume the plates each have a large 
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FIGURE 4.4 Parallel plate capacitor. The dashed cylinder is a Gaussian surface used in 
the analysis. 

area A, and by large we mean that the lateral dimensions are much larger than the 
separation. Then there is a symmetry—translation invariance in directions parallel 
to the plates. The quantities we seek will not depend on the lateral coordinates, 
x or y. The potential and electric field will have the forms V (x) = V (z) and 
E(x) = £ z (z)k. 

In the gap the potential must be a solution of Laplace’s equation, which for 
this case is V 2 V = d 2 V/dz 2 = 0, satisfying the boundary conditions U(0) = 0 
and V(d) = Vo- The solution, which is unique, is clearly V (z) = Voz/d. The 
corresponding field, obtained from E = — VV, is £ z (z) = —Vo/d. The surface 
charge densities are obtained from (4.1), in which n is the outward normal at each 
surface. At the upper surface of Conductor 1 (z = 0) we have cr\ u = —eo Vo/d; 
and at the lower surface of Conductor 2 (z = d) we have 021 = +eo Vo/d. There 
are no surface charges on the other surfaces (z = d+d 2 or z = —d\ ); this fact will 
become clear in Example 1 following. Thus the total charges on the conductors 
are equal and opposite, Q\ = —eoVoA/d and Qi = +eoEo A/d. 

Notice that inside the conductors there is no electric field. Of course this is 
a condition for electrostatic equilibrium, but it is also a consequence of Equation 
(3.46), for the field of a planar charge. That is, the plane of surface charge at z = 0 
contributes a field E z = — Vo/Id for all z > 0, and the plane of surface charge at 
z = d contributes a field E z = + Vo/2 d for all z > d. Thus E = 0 for all z > d, 
in particular inside Conductor 2. Likewise E = 0 for all z < 0, in particular inside 
Conductor 1. In the conductors the fields from the two inner surfaces cancel; in 
the gap they add. 

The Parallel Plate Capacitor 

Two conductors that carry equal but opposite charges ± Q form a capacitor. The 
capacitance of the system, which depends only on the geometry, i.e., on the sizes, 
shapes, and relative position of the conductors, is defined as 



(4.3) 
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where V is the voltage difference between the conductors. 5 The unit of capaci¬ 
tance is the farad (F), defined by 1 farad = 1 coulomb/volt; or, 1F = 1C/V. The 
conductors in Fig. 4.4 make up a parallel plate capacitor, whose capacitance is, 
by the previous results, 


C = 


€0 A 
d ' 


(4.4) 


In stating the problem at the start of this section we implied that the conductors 
were charged by connecting one to a battery and the other to ground. We could 
instead have considered the plates to be isolated and provided with charges of ±Q. 
That system would be equivalent to Fig. 4.4, except for a constant added to the 
potential. The potential difference between the plates would be V = Qd/(eoA ) = 
Vo, the same as before, and the electric field and surface charge densities would 
also be the same. 


EXAMPLE 1 The previous discussion shows that if a potential difference Vo is 
established between two, originally uncharged, conductors, by connecting them 
to opposite electrodes of a battery, then they obtain equal but opposite charges. 
What the battery does is transport electrons from one conductor to the other. In 
Fig. 4.4 the net charge accumulates on the surfaces facing the gap. However, if 
the conductors are isolated, from each other and from ground, then they can have 
arbitrary charges. In this example we ask the general question: How is the charge 
distributed for arbitrary charges Q' and Q" on the two plates? 

So, now suppose the plates in Fig. 4.4 are isolated, with total charge Q' on 
Conductor 1 and Q' on Conductor 2. What will the charges be on each of the 
four surfaces? The four quantities that we seek are Q\ u and Qu, the charges on 
the upper and lower surfaces of Conductor 1, and Qi u and Qu, the corresponding 
quantities for Conductor 2. 

These charges may be determined from the following four equations: 


Qiu + Qu = Q ', 

(4.5) 

Qiu + Q21 = Q" 1 

(4.6) 

Qiu Qu _ „ 

A A ~ 

(4.7) 

Qu Qiu _ „ 

A A ~ 

(4.8) 


The first two equations express conservation of charge. The third equation is a 
consequence of Gauss’s Law applied to the cylindrical surface shown dashed in 
Fig. 4.4. The electric flux is 0 so the charge enclosed is 0, which is (4.7). This 
equation implies Q\ u = — Q 21 , i.e., that the charges on the inner surfaces are 

5 A safer, but clumsy, notation would be to write the voltage difference as A V, so as not to confuse it 
with the absolute potential, whose value is only determined up to an additive constant. 
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equal but opposite. The fourth equation, which implies Q\i = Q 2u , i.e., that 
the charges on the outer faces are equal, is true because E z must be 0 inside 
the conductors: The field in either conductor is determined by the contributions 
of the four planes of charge, each of which gives E p i an e = ncr/2eo- Since the 
contributions of the two inner surfaces are equal but opposite, the contributions 
of the outer two surfaces must also be equal but opposite to have E = 0, which 
requires Q v = Q 2u . 

The solution to the four equations is 

Qu = Qiu = (Q' + Q")/2 (4-9) 

Qm = -Qu = ( Q ' - G")/2. (4.10) 


There is an interesting symmetry, in that the charges on the outer surfaces are 
equal, with each carrying half the total charge in the system. The field in the three 
regions outside the plates is 


E z (z) = 


(Q' + Q")/(2eoA) for z > d + d 2 
(Q'~ Q")/(2eoA) forO <z<d 
~(Q' + Q")/(2 €oA) for z < —d\ 


(4.11) 


These field strengths are independent of the thicknesses d, d \, and d 2 . If Q" = 
— Q' then we recover the previous results of a parallel plate capacitor. 


Energy of a Capacitor 

It is clear that energy is stored in the electric field of a capacitor, because electrical 
work is done by a battery in charging a capacitor, and mechanical work can be 
extracted from the capacitor by letting the two conductors, which attract each 
other, come together. The total electrostatic energy U in a system of charges and 
fields was determined in Chapter 3 to be 



Using this general expression we may express the energy in a capacitor in terms 
of C, Q, and V. 

Consider a capacitor with arbitrarily shaped conductors at potentials V\ and 
V 2 and with respective total charges —Q and +Q. Such a generalized capacitor 
is shown in Fig. 4.5. For clarity assume there is no external field or other charges 
anywhere. To find the energy we use (4.12) by substituting the surface charge 
element a(x)dA, where cr(x) is the surface charge density on the conducting sur¬ 
faces of the capacitor, in place of the volume charge element p(x)d i x. That is, 
U = j f oVd A. The integral is over the surfaces of both conductors. Since V is 
constant on either surface, the integral over the first conductor is U\ = —QV\/2, 
and that over the second conductor is U 2 = + Q V 2 /l. Thus the total electrostatic 
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FIGURE 4.5 A generalized capacitor, consisting of two conductors with equal but op¬ 
posite charge. 


energy is U = U\ + I /2 = Q V /2, where the positive quantity V = V 2 — V\ is the 
difference of potential between the conductors. For any capacitor, 

U = l -QV = \cv 2 = (4.13) 

2 2 2C 

Another way to derive the energy stored in a capacitor, from first principles, is 
to calculate the work done to separate the charge + Q and — Q. To move a small 
amount of charge d Q' from the conductor with charge — Q' to the conductor with 
charge +Q' requires work dW = V dQ 1 , where V = Q'/C is the potential 
difference. The total work to accumulate charges ± Q on the two conductors is 

f Q Q'dQ' Q 2 

u -Lnr-$c- (4 ' 14) 

which is the same result as (4.13). 

Equation (4.13) can be used to find the capacitance of two conductors whose 
shapes may be irregular. The method is: (a) Find the potential V (x) between the 
conductors; (b) Calculate E = — VV; (c) Evaluate the total field energy with 
(3.84) and equate the result to one of the expressions in (4.13). This method is 
sometimes more convenient for calculating C than using the definition Q / V. 

4.2.2 ■ Problems with Rectangular Symmetry and External Point Charges. The 
Method of Images 

In the previous section we considered systems with charged plates, or with plates 
at given potentials. Next we consider problems with external point charges as 
well. In general these are more difficult because the external charges influence the 
charge distributions on the conductors and can affect the potentials. Furthermore, 
the system is no longer translation invariant in any direction, because the position 
of the charge is a preferred point in space. 

The classic, simplest example of this kind is a point charge at distance zo from 
a grounded plane. The arrangement is shown in Fig. 4.6(a), in which the charge 
+q is fixed at the point (0, 0, zo) and the grounded plane is z = 0. Formally 
the potential V(x, y, z) we seek is the solution in the region z > 0 to Poisson’s 
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FIGURE 4.6 (a) Point charge over a conducting plane; q is the real charge and —q is 
the fictitious image charge, (b) Electric field lines (solid curves) and equipotentials (dashed 
curves) for this system. 


equation (3.66) with p(x) = qS(x)S(y)S(z — zo), satisfying the boundary condi¬ 
tion V{x, y, 0) = 0. For z < 0, i.e., inside the conductor, we know a priori that 
V = 0. 

However, it is not necessary to proceed through a formal solution of the bound¬ 
ary value problem, because we can use a clever trick, called the method of image 
charge, to determine the potential. It is easy to see that if we construct V(x, y, z) 
as the sum of the potential of the charge +q plus the potential of a fictitious image 
charge —q located at (0, 0, —zo), we will have the potential function that we seek. 
The potential for these two charges is 

_1_ _1_" 

y/x 2 + y 2 + (z - zo ) 2 \/x 2 + y 2 + (z + zo ) 2 

(4.15) 

It is important to understand why this construction satisfies all the equations. The 
boundary condition is satisfied; that is, V (x, y, 0) = 0. This is obvious mathemat¬ 
ically, from (4.15); physically, in the image system the plane z = 0 is equidistant 
from the charges +q and —q so its potential is 0. Also, Poisson’s equation is satis¬ 
fied in the region z > 0 because the second term in (4.15) is equivalentto a source 
in the region z < 0, so contributes nothing to V 2 V for z > 0. The corresponding 
field E = — W is 

xi + yj + (z - zp)k _ xi +yj + (z + zp)k 

(x 2 + y 2 + (z - zo) 2 ) 3/2 (x 2 + y 2 + (z + zo ) 2 ) 3/2 

(4.16) 

We emphasize that (4.15) and (4.16) apply only in the region z > 0. In the region 
z < 0, V and E are 0. 

Next we analyze the charge on the plate. We shall find that when q is placed at 
(0, 0, zo), an equal but opposite charge is drawn onto the conductor from ground. 


E (x,y,z) - 


47T60 


V(x, y, z) = 


47T60 
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The net charge of the conductor lies on the surface z = 0. We wish to determine 
the surface charge density a. By symmetry, er is a function only of r = Jx 2 + y 2 . 
According to (4.1) it is a(r) = eo E„(r) = eo E z (x, y, 0) where E n is the field 
component along the outward normal of the surface. Using (4.16) to calculate the 
normal component on the surface, we see that the two terms contribute equally 
to E z (x, y, 0) because the real charge and image charge are equidistant from the 
surface. The result is 


cr(r) = 


-qz o 

2n (r 2 + zlf 2 


(4-17) 


It is left as an exercise 6 to show that the total charge induced on the plane 
/ 0 °° a{r)2nrdr is —q. Figure 4.6(b) shows the electric field lines originating 
at +q and terminating on the conductor, as solid curves. The equipotentials are 
shown as dashed curves. 

A point charge near a grounded conducting plane experiences a force toward 
the plane. We leave it as an exercise 7 to calculate the force on the charge. 

It is important to emphasize that there is no charge at z = — zo- We know that 
the charge density inside a conductor in electrostatic equilibrium is 0. The electric 
field is actually the vector sum of the Coulomb fields for all real charges, namely, 
the point charge +q and the surface charge of (4.17). For z > 0 these fields sum to 
give (4.16), and for z <0 they sum to give E = 0. Introducing the image charge 
at z = — zo is just a mathematical trick. 

Figure 4.6 and the previous discussion can be extended to a technique for find¬ 
ing the electric field for any charge distribution above a grounded conducting 
plane. The underlying principle is the same as for the Green’s function: If we 
know the potential for a point charge then we can construct the potential for a 
distributed charge by a superposition integral. This idea leads to the general rule 
that if p(x, y, z) is the real charge distribution (for z > 0), then the appropriate 
image distribution in the region z < 0 is — p(x, y, — z). For example, if a charged 
line or line segment lies above the grounded plane, then the image charge is its 
miiror image charged oppositely. An interesting case is a uniformly charged infi¬ 
nite line parallel to the surface. In that case, as we will see in Sec. 4.4 when we 
treat cylindrically symmetric systems, the equipotential surfaces are non-coaxial 
cylinders. 


EXAMPLE 2 Figure 4.7 shows the case of an electric dipole p = po j located 
at the point (0, 0, zo) above the grounded plane z = 0. What is the asymptotic 
electric field on the z axis for z zo? 

The image charge configuration that makes V = 0 on the plane z = 0 is an 
image dipole p' = — po j located at the point (0, 0, —zo)- Equations (3.99) and 
(3.101) give the potential and field of a dipole. Proceeding from (3.101), note that 


6 See Exercise 6. 

7 See Exercise 7. 
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FIGURE 4.7 Example 2. An electric dipole over conducting plane, p is the real dipole, 
and p' is the fictitious image. 


p r = 0 and p' • r = 0 for points on the z axis, so 


E(0,0, z) = E y {z)\ 


Poj 

47TC0 


1 

(2 - Zo) 3 


+ 


1 

(z + zo) 3 




(4.18) 


where the contributions in the square brackets are from the real and image dipoles, 
respectively. The leading nonzero term in the expansion for z zo is E y (z) = 
— 3pozo/(2neoz 4 ). Not surprisingly, the far field is that of a quadrupole, because 
the total charge and total dipole moment are both zero for the combined system 
of real dipole plus image dipole. 

Could the field have been obtained by calculating V (x) and using E = — VV? 
The answer is that to find an expansion for E(x) at arbitrary x it would be easier to 
proceed by using the potential—a scalar—rather than the vector field. However, 
the potential is identically 0 at all points on the z axis, so to find E(0, 0, z) it is 
easier to compute the field itself as we have done. 

In this example we could have considered the real dipole to be the limit of two 
individual charges ±q separated by a small distance. The image dipole would 
then be the combination of the images q of the real charges. 


An interesting problem involving multiple images is shown in Fig. 4.8. A real 
charge q\ = +q is at (0, yo, Zo), and the mutually perpendicular planes y = Oand 
z = 0 are both grounded. The image charge that is required to set the plane y = 0 
to potential zero is <72 = at (0, — yo, zo) ■ The image charges then required to 
set the plane z = 0 to potential zero are <73 = —q at (0, yo, —zo) and <74 = +q 
at (0, —yo, — zo)- The potential and field in the region y, z >0, which contains 
the real charge, is the sum of the Coulomb potentials of q\, < 72 , < 73 , < 74 . The surface 
charge densities can be obtained with (4.1), noting that n = k for the z = 0 plane, 
and n = j for the y = 0 plane. In the regions behind and within the conductors, 
the potential and field vanish. In Fig. 4.8 the conducting planes are at an angle of 
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FIGURE 4.8 Multiple image charges. 


90 degrees. Some problems with planes at other angles can be solved in a similar 
way. 8 


■-- 

EXAMPLE 3 Figure 4.9 shows an electrostatics problem whose solution requires 
an infinite number of image charges. In this configuration there is a charge +q at 
the origin, and the two planes z = d and z = —d are grounded. 

We generate the image-charge solution by an iterative procedure. Start by 
putting an image charge — q at +2d on the z axis, in order to make V = 0 on 
the z = +d plane. But now V is not 0 on the z = —d plane, so put image charges 
— q at z = —2d, and +q at z = —4 d in order to make V = 0 on the the z = —d 
plane. But now V is not 0 on the z = +d plane, so put additional image charges 
in the region z > +d ; and so on ad infinitum. In each iteration first set the upper 
conducting plane to zero potential, and then set the lower conducting plane to zero 
potential. The final result is that in the region z > d there are image charges of 
i — \) k q at positions z = 2 dk with k = 1 , 2 , 3...; and in the region z < —d there 
are image charges of ( — \) k q at positions z = —2 dk. The images are symmetric 
about the z = 0 plane. Finally, the potential on the z = 0 plane is the sum of 
Coulomb potentials of the real charge and all the images, 


V(r ) = 


4 

4tT 6() 


1 

r 


+ £ 


k =1 


2 (— 1 )* 
•Jr 2 + 4 d 2 k 2 


(4.19) 


where r = jx 2 + y 2 is the radial distance from the origin. Generally, the poten¬ 
tial and field in the entire region — d < z < d are superpositions of contributions 


See Exercises 11, 12. 
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FIGURE 4.9 Example 3. A charge between grounded conducting planes has an infinite 
number of image charges, of which the first few are shown. 


from +q and all the image charges. Although the potential is a sum over an infi¬ 
nite number of image charges, the terms have alternating signs so convergence is 
rapid. 

If there were less symmetry, the problem would still be tractable but harder. 
Suppose, in a configuration like Fig. 4.9, the charge were closer to one plate than 
the other. We could still solve the general problem, i.e., to find the potential and 
field between the plates, by orderly placing of image charges. However, for this 
less symmetric case, the charges will not be symmetric about the z = 0 plane. 


4.3 ■ PROBLEMS WITH SPHERICAL SYMMETRY 
4.3.1 ■ Charged Spheres 

The simplest electrostatic system with spherical symmetry is a conducting sphere 
of radius a that carries a charge Qo- What are the potential and field outside and 
inside the conductor? Because of the symmetry, they have the form V = V (r) 
and E(x) = E r (r )?; i.e., V(r) and E r (r) do not depend on the polar coordinate 6 
or the azimuthal coordinate 4>. 

We can solve this problem from previous physical considerations. We know 
that all the charge must be uniformly distributed on the surface of the sphere, 
whether it is solid or a shell, i.e., a = Qo/iAna 2 ). Straightforward application of 
Gauss’s Law gives E r (r) = Qo/(Aneor 2 ) for r > a, and E r (r) = 0 forr < a. 
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The external field is the same as for a point charge at the origin, so the corre¬ 
sponding potential is V(r) = Qo/(4rreor) for r > a, and V(r) = Qo/i^neoa) 
for r < a. There is a discontinuity in E r at r = a, because of the surface charge, 
equal to er/eo, in agreement with (4.1). The potential, however, is continuous at 
r — a. We have described this system as an isolated conducting sphere with total 
charge Qo- But we could instead specify the potential of the sphere to be, say, Vo. 
This case would be equivalent to the charged isolated sphere if the charge were 
taken to be Q o = 4treo Voa. Then the potential for r > a is V(r) = Voa/r. 

We have obtained the potential function by physical arguments. A more formal 
approach would be to find the solution V(r) to Laplace’s equation, V 2 V =0, that 
satisfies the boundary conditions V (a) — Vo and V(oo) = 0. For a potential that 
depends only on r, Laplace’s equation becomes 



We may try to solve the equation with a power V(r) oc r p , and the equation 
requires p = 0 or p = — 1. The general solution is 

V(r) = — + B, (4.20) 

r 

where A and B are constants to be determined from the boundary conditions. The 
condition V(oo) = 0 requires B — 0; and the condition V(a) = Vo then requires 
A = Vo a. The potential is thus V(r) = Voa/r forr > a, as before. 


j Distribution of Charge on a Conductor 
The surface charge distribution is uniform on a spherical conductor, but finding 
cr(x) for an irregularly shaped conductor will be difficult. One useful principle is 
that charge on a conductor accumulates at those regions of the surface where the 
curvature is high. It follows that E is largest near sharp points. 

As an illustration, Fig. 4.10 shows a schematic conductor consisting of a large 
sphere of radius a connected by a long wire of length d to a small sphere of 



FIGURE 4.10 The system of two conducting spheres connected by a wire illustrates the 
fact that the charge density is greatest where the curvature is greatest, on the small sphere. 
The ratio Ob/o a is a/b. 
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radius b\ we have then a » b and d a, b. The large sphere represents a part 
of the conductor with small curvature, and the small sphere a region with large 
curvature. Because the entire conductor is at the same potential, Q a /(47T€oa) = 
£W(4treo&); from this we see that Q a = ( a/b)Qb , where Q a and Qb are the 
charges on the spheres. The ratio of the charge densities, which by (4.1) is the 
same as the ratio of the normal surface electric fields, is 

ob _ E n (b) _ Qb/b 2 _ a 
a a E n (a) Qa/a 2 b 

The phenomenon that E is largest near sharp points has important practical 
implications. The breakdown field for air is about 3 x 10 6 V/m. When the field 
reaches this value corona discharge can occur; the mechanism is that ions in the 
air are accelerated by the field and, by collisions, produce more ions and elec¬ 
trons. Corona discharges are important for xerography, ink-jet printers, field-ion 
microscopes, and other devices. 


Two Concentric Conducting Spheres; Spherical Capacitor 
It was hardly necessary to use the formal solution (4.20) for the problem of one 
conducting sphere, but it will be very useful for finding now the potential and 
field for the case of two concentric conducting spheres, shown in Fig. 4.11. Let 
the inner, solid sphere have radius a and be at potential Vo- Let the outer sphere, 
drawn as a spherical shell, have inner radius b and be grounded at potential zero. 
What is the potential V ( r ) and the field E r (r) in the region a < r < b? 

Stated as a boundary value problem, we seek the constants A and B in (4.20) 
such that V (a) = Vo and V ( b ) = 0. The algebra reduces to solving two linear 
equations in the two unknowns A and B. The solutions for the potential and field 
in the region a < r < b may be written in the form 



FIGURE 4.11 A spherical capacitor, consisting of an inner conducting sphere and 
a concentric conducting shell. The inner and outer conductors have equal but opposite 
charge. 
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V(r) = 


Voa(b - r) 
(b — a)r 


and 


Er(r) = 


Voab 

(b — a)r 2 ’ 


(4.21) 


where E r (r) = —dV/dr from E = — VV. Inside the inner conductor, r < a, the 
potential is Vo and the field is zero. For r > b the potential and field are both zero. 
It is left as an exercise 9 to sketch graphs of these functions. 

The charges on the two spheres may be obtained from (4.1) with the field given 
in (4.21). Thus, on the surface of the inner sphere a = eo E r (a) = eo Vo b/(b—a)a, 
and so the total charge is Q(a) = 4 tt€oV oab/(b — a). Similarly, on the inner 
surface of the outer sphere, a{b) = —eo E r (b), and a short calculation shows that 
the total charge is Q(b) = — (2(a). The charges are equal but opposite. 

The capacitance of the two concentric spherical conductors shown in Fig. 4.11 
may be determined from the definition (4.3). The potential difference is Vo- Using 
the charge found above, the capacitance of a spherical capactor is 


Q Aiteoab 
Vo b — a 


(4.22) 


An interesting limiting case occurs when b a. That case, i.e., the capacitance 
with respect to infinity of a spherical conductor of radius a, has C = 4;reoa. For 
example, for the Earth, whose mean radius is R = 6371 km, the capacitance with 
respect to infinity is CEarth = 4tt€oR = 7 x 1CT 4 F. The actual capacitance of the 
Earth with respect to the atmosphere is larger, about 100F [1], 


Spherical Problems with Dependence on the Polar Angle 6 
The solutions to Laplace’s equation are called harmonic functions in mathemat¬ 
ics, and the study of these functions is a large subject with a long history. There is 
an infinite set of solutions appropriate for spaces with spherical geometry, called 
spherical harmonics, and we will study the use of these general solutions in Chap¬ 
ter 5. But for now we will consider several classic systems for which the depen¬ 
dence on r and 6 is quite simple. 

We are already familiar with some solutions to Laplace’s equation of the form 
V (r, 6). In particular consider 

A C cos 0 

V{r,9) = -E B -|- - -b Dr cos 0. (4.23) 

r r l 

We know from past examples that each term in (4.23) satisfies Laplace’s equation 
at all points, except for the singularities at r = 0 in the first and third terms. 10 
Each term in (4.23) has an interesting meaning. 

(a) The term A/r corresponds to a point charge at the origin or to the external 
potential of a spherically symmetric charge distribution. The total charge on 
the system is related to A by A = Q t Ot /4jreo. 

9 See Exercise 13. 

10 The reader can verify V 2 V = 0 using polar coordinates. 
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(b) The term B is the arbitrary constant that can always be added to the potential 
without changing the field. These first two terms are the same as in (4.20). 

(c) The third term is the potential of a dipole at the origin pointing in the direction 
k, as in (3.100). The dipole moment is related to C by C = p/Aneo. 

(d) The final term, which is new, corresponds to a field E(x) = Eqk uniform 
throughout space. We may write this term as V^(r, 6) = Dr cos 6 = 
Dz = V 4 (z), because z — r cos 6. The corresponding electric field is 
E(x) = — VV 4 = — Dk. Thus the field strength is related to D by £0 = — D. 

Knowing that (4.23) satisfies Laplace’s equation in any region excluding the 
origin, for arbitrary constants A, B, C, D, we can use it to solve boundary value 
problems for which all the boundary conditions can be satisfied by this form. 11 
We now apply (4.23) to an interesting and important problem. 


EXAMPLE 4 Figure 4.12 shows a grounded conducting sphere, of radius a and 
centered at the origin, in an externally applied field E app i(x) = L’ok in the z 
direction. The presence of the sphere changes the field. What are the potential 
V(r, 6), field E(x), and surface charge density a(6)1 

We know that V(r,6) must satisfy Laplace’s equation outside the sphere, and 
(4.23) is a solution. Now we must satisfy the boundary conditions: (i) V(a, 6) = 
0, because the conductor is grounded; and (ii) V —► —Eor cos 6 as r approaches 
infinity, i.e., for r a, which we have seen is the potential of the applied field. 
The conducting sphere affects the field near the sphere, but its effect must ap- 


z 



FIGURE 4.12 Example 4. A grounded conducting sphere in an applied electric field 

E 0 k- 

11 In Chapter 5 we will develop a general method for solving Laplace’s equation for V (r, 9). Equation 
(4.23) is a special case that suffices for several interesting problems. 
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proach 0 as r —► oo; hence (ii). If we find values for A, B, C, and D, which 
satisfy these conditions, then the uniqueness theorem insures that they give the 
correct field. 

First we argue that A = 0 because the applied field acts equally, but in oppo¬ 
site directions, on positive and negative charges. Thus whatever positive charge 
appears on the sphere above the z = 0 plane, an equal amount of negative charge 
will appear below it Next we set the arbitrary constant B equal to 0. To satisfy 
condition (ii), notice that as r —> oo the term C cos 6/r 2 vanishes and leaves only 
the fourth term. Hence D = —E o. Thus far by substituting these values into (4.23) 
we have V(r, 9) = — E^r cos 9 + C cos 6/r 2 . To satisfy (i) requires C = £oa 3 . 
Finally, then, for r > a. 


V ( r , 9) = — Eor cos 9 + 


£oa 3 cos 9 


(4.24) 


Inside the sphere, V = 0. The term proportional to cos 9/r 2 is the potential of 
a dipole p = 47r«oa 3 £ok at the origin. The applied field polarizes the sphere, 
producing a surface charge distribution that creates a pure dipole field everywhere 
outside. The induced dipole moment p is proportional to the volume of the sphere. 

The components of the electric field for this problem are, from E = — W in 
spherical coordinates, 


E r (r, 9) = £o 



Eg(r,d) = —£ 0 



(4.25) 

(4.26) 


In these equations, the first terms are just the applied field. The second terms, 
which are proportional to a 3 , are the contributions to the total field from the charge 
distribution on the sphere. As usual, the surface charge density can be obtained 
from (4.1) as a(9) = eo E r (a, 9). The result, which is remarkably simple, is 


a(9) = 3eo£ocos0. (4.27) 

The total surface charge, which is given by (7tot = a(9)2na 2 sin 9d9, is 0 as 

we originally surmised. The result ot = 0 implies that no charge is transferred 
between the sphere and ground. Therefore, if in the original statement of the prob¬ 
lem we replace the grounded sphere by an uncharged isolated sphere, the same 
results (4.24)-(4.27) still hold. 

An extension of this problem is for the sphere to be isolated and given a net 
charge Qo. In that case, Q o must be distributed uniformly over the surface of the 
sphere, in order that the surface r = a remain an equipotential. The potential 
of the sphere is then Qa/{4neoa)\ a term Qo/{Aneor) is added to the external 
potential; r Qo/i^neor 2 ) is added to the external field; and Qo/(4iza 2 ) is added 
to the surface charge density. 
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FIGURE 4.13 (a) A point charge outside a conducting sphere, q is the real charge and 
q' the fictitious image charge, (b) The electric field lines and equipotentials. 


4.3.2 ■ Problems with Spherical Symmetry and External Charges 

The principal problem that we will consider involving a spherical conductor and 
an external charge is shown in Fig. 4.13(a). A grounded conducting sphere of 
radius R has its center at the origin. A point charge q is at distance zo from the 
center of the sphere; we take q to be on the polar axis at (0, 0, zo)- What are the 
potential V ( r , 6) and field E(x) for r > R1 What is the surface charge density 
a(6) on the sphere? Because of the axial symmetry, V(r, 6) and a(0) do not 
depend on the azimuthal angle <p. 

We seek a function V ( r , 6), satisfying Laplace’s equation in the region r > R, 
except for a singularity at q, and equal to zero on the surface r = R. The sources 
of this potential will be q itself as well as the charge distributed on the surface 
of the sphere. A nice and rather surprising feature of this problem is that a single 
image charge can be found which, together with q, will give the correct potential. 
If we call that image charge q' and place it at (0, 0, z' 0 ) then the potential for 
r > R is 


V(r,e) = 


1 

4;reo 


S’ 


+ 


2 + Zq - 2rzo cos 6 




+ Zq 2 - 2 rz' 0 cos 6 


. (4.28) 


Now we must determine q' and z' 0 in terms of the experimentally selected param¬ 
eters q, zo, and R. 

The image charge must lie inside the sphere, because there is only one singu¬ 
larity (q ) for r > R. Ultimately q 1 and z' 0 will give V (R, 6) = 0 for all 6. But q' 
and Zq can be determined from the simpler requirement that the potential be zero 
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at the two points A and B on Fig. 4.13(a). The condition at A is V(R, 0) = 0; by 
(4.28) this implies q(R—z'o)+q'(zo — R) = 0. The condition at B is V(R, tt) = 0; 
by (4.28) this implies q(R + Zq) + q'(R + zo) = 0. These two linear equations 
may be solved for q' and z' 0 with the results 

, — qR , R 2 

q = - and z Q = —. (4.29) 

zo zo 

Substituting the results back into (4.28), the potential function V (r, 6) may be 
written as 


V(r, 6) = 


47rf ° [y/r 2 + z 2 0 -2rzocose yjr 2 z^ + R 4 - 2rzoR 2 cos8 J 

(4.30) 

In (4.30) the first term is the contribution to the external potential due to the orig¬ 
inal charge q, and the second term is the contribution due to the surface charge 
on the sphere, which is equivalent to the potential of the fictitious image charge 
q' in the interior. It is obvious from (4.30) that V(R, 6) = 0 for all 0. Thus the 
surface of the conductor, and indeed the entire conductor, is an equipotential at 
potential zero. The uniqueness theorem assures us that (4.30) is the correct poten¬ 
tial for our problem. The two leading terms in the potential for the far-field, j.e., 
where r R and r J>> zo. correspond to a point charge q( 1 — R/zo) and a dipole 
qz o(l — /? 3 /Zg)k at the origin. 

The positions on the polar axis of the original charge and the image charge 
satisfy the relation zoZq = R 2 . These two points are called conjugate points. 
More generally, for any point (r, 6, <p) outside the sphere there is a conjugate 
point (/-', 0, (p) inside, at distance r' = R 2 / r from the center. If the exterior point 
is near the surface, then the conjugate point is approximately equidistant inside 
the surface, like the image in a plane. As the exterior point recedes from the sphere 
the conjugate point approaches the center. 

The electric field E(x) = — VV can be found from (4.30) by taking the gra¬ 
dient in spherical coordinates. In Fig. 4.13(b) are shown some field lines and 
equipotentials for this problem. In general the field has both components E r {r, 6) 
and E(t(r, 6); their calculation is left as an exercise. 12 Every point in space lies 
on some equipotential surface and the electric field vector at the point is perpen¬ 
dicular to the surface. In particular Eg = 0 at r = R, the surface of the sphere, 
in accord with the general result that there can be no component of electric field 
tangent to a conducting surface in electrostatic equilibrium. 

Another interesting quantity is the surface charge density on the conductor. It 
is obtained as usual from (4.1). In this case, o(6) = eo E r (R, 6) where E r = 
— dV/dr. The result is 


om = 


_ ± (* 2 ~ Zq ) 

4ttR ( R 2 +Zq — 2Rzocos6 ) 3 


See Exercise 15. 
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Notice that a (9) is negative (assuming q positive) for all 9. Its magnitude is 
largest at 9 =0, the point on the sphere nearest q, and smallest at 9 — n, the 
point farthest from q. The physical explanation is that a positive charge q attracts 
from ground only negative charges. The total charge on the conductor may be 
calculated by integrating (4.31) over the area, 


2 tot — 



a(9)2n R 2 sin 9d9 = — qR/zo, 


(4-32) 


which is just equal to the image charge q'. This result is obvious by applying 
Gauss’s Law to a spherical surface lying just outside the conductor, because the 
field due to the surface charge is the same as for q'. 

It is important to understand that q' is fictitious. The physical charge in the 
system consists of q and a (9). If the conductor itself could somehow be removed, 
leaving the original charge q at (0, 0, zo) and the surface charge a (9) at r = R, 
then the potential and field would be unchanged/or all r, both inside and outside 
the conducting sphere. But E outside the sphere due to a (9) is the same as if there 
were a charge q' at the conjugate point. Because it is E that is detected, a(9) and 
q' are equivalent. 

As an exercise 13 the student is asked to calculate the force on the charge q. 


An Isolated Charged Conducting Sphere with an External Point Charge 
It is natural to consider next a related problem. What would the potential and 
field be if the conducting sphere were not grounded but rather isolated with total 
charge QqI We solve this new problem, for the exterior region, by placing an¬ 
other fictitious charge Q o — q' at the origin. That is, the potential in the region 
r > R, is that in (4.30) plus the term (Qo — q')/(Aneor). The entire conductor, 
including its surface, is again at constant potential, but the potential in this case is 
(Qo - q')/(4neoR). Mathematically, it is obvious that this two-image potential 
satsifies the equations. Physically, one can understand the result by starting with 
the problem of the grounded sphere, then isolating the sphere from ground and 
adding charge to it until the net charge on it is Qo- The charge (Qo — q’) that 
must be added in this process distributes itself uniformly over the surface. But a 
uniform spherical charge has the same potential as a point charge at the origin for 
r > R. 

The surface charge density of the isolated sphere is a (6) of (4.31) plus (Qo — 
q')/(An R 2 ). For the special case of an isolated uncharged sphere the value of the 
second fictitious charge at the center is just —q'. 

Another related problem is for the conducting sphere to be held at a given 
potential Vo- The physical situation is that there is a battery connected between 
ground and the sphere in Fig. 4.13. For this situation the external potential is ob¬ 
tained by adding the term RVo/r to (4.30), because this satisfies both Laplace’s 
equation in the exterior region and the boundary condition. This result is equiva- 


13 See Exercise 15. 
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lent to placing a fictitious point charge 47reo^ Vo at the origin or, physically, to an 
additional surface charge density cq Vo//?. 


Spherical Problems with an Infinite Number of Image Charges 
Example 3 showed how to find the potential for a point charge between two 
grounded planes by the image method. Its solution requires an infinite number 
of images, but the series converges because successive images become ever more 
remote from the region of interest, between the plates. 

Similarly interesting problems involving conducting spheres can be solved 
with an infinite number of images. Three classic problems of this kind are: 

(i) To find the capacitance between a sphere and a plane. 

(ii) To find the capacitance between two spheres of arbitrary radii; the spheres 
may be outside each other or one may be inside the other and off center. 

(iii) To find the potential in space for a point charge between two conducting, 
grounded, spheres. 

In each of these cases the strategy is basically the same idea as in Example 3. 
First, put a charge at the center of one spherical conductor, which makes its sur¬ 
face an equipotential but, in general, causes the second conductor not to be an 
equipotential. (In case (iii) the first image would be put at the conjugate point of 
one of the spheres.) Then put a suitable image within the second conductor that 
makes its surface an equipotential but causes the first conductor no longer to be an 
equipotential. Then a new image charge is put within the first conductor to restore 
its surface to an equipotential. This iteration is carried through until the resultant 
series for the potential converges as close to the ultimate value as required. In 
these problems the series converges because successive charges become smaller 
by a factor of order R/d where R is the radius of one of the conductors and d is 
the separation between conductors. If R/d is reduced the convergence is faster. 
These examples are treated in detail in the books listed at the end of the chapter 
(Refs. 2-5). 


4.4 ■ PROBLEMS WITH CYLINDRICAL SYMMETRY 

4.4.1 ■ Charged Lines and Cylinders 

A charged line with constant linear charge density A is the simplest cylindrically 
symmetric system. What are its field and potential? This problem was already 
considered in Chapter 3 (Examples 2 and 6) by using Gauss’s Law, from which we 
derived the field. But we will start here by finding the potential in a more general 
way as a solution to Laplace’s equation. There are two symmetries: translation 
invariance in z, the coordinate along the line; and rotational invariance in 4>, the 
azimuthal angle. Therefore the potential can only depend on r, the perpendicular 
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distance to the line. In cylindrical coordinates Laplace’s equation becomes 


1 d_ 
r dr 



= 0 , 


which has the general solution 

V{r) = A In (r/ro) + B (4.33) 

where A and B are constants, and ro is an arbitrary scale of length (e.g., 1 m). (A 
change in r o can be absorbed into the constant B because 

In r/r' Q = In r/ro + In ro/r' 0 

and the last term is a constant.) For example, recall for a line charge with charge 
density X, 


V(r) = ^ln—, 
2neo ro 


(4.34) 


where r o is arbitrary. This result has the physically unrealistic property that it is 
infinite at r = 0 and negatively infinite at r = oo, but that is because we have 
considered an ideal line, i.e., with infinite length and infinitesimal diameter. A real 
charged wire would be finite, and (4.34) would approximate the potential near the 
wire. The divergences at r = 0 and r = oo are tolerable because the expressions 
(4.33) and (4.34) give finite potential differences, which are the only physically 
measurable properties of V ( r ) anyway. For example, the work required to take 
unit charge from r\ to r 2 is V(r 2 ) — V(r\) = (X/2neo) \n(r\/r 2 ). 

Consider next a conducting cylinder of radius a which carries the constant line 
charge density X spread uniformly on its surface. 14 For the potential in the region 
r > a we may write 


V(r) = 



(4.35) 


Here r 0 and B have been chosen so that V (a) = 0. This choice of reference point 
is different from the spherical case, because here the potential blows up as r — 00 
and the potential of the conductor is zero. This unusual situation occurs because 
the total charge on the cylinder is infinite. 


Two Concentric Conducting Cylinders; Cylindrical Capacitor 
The preceding results can be extended to find the potential between two concen¬ 
tric conducting cylinders and their capacitance per unit length. In Fig. 4.14 the 
potential of the inner cylinder, of radius a, is set at Vb; the outer cylinder, of ra¬ 
dius b, is grounded. To find V (r) for a < r < b we choose the constants A and 

14 The surface charge density a and linear charge density X of a uniformly charged cylinder are related 
by kdl = ad A where dA = 2 nadt, that is, X = 2tt aa. 
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FIGURE 4.14 A cylindrical capacitor, shown in cross section. 


B in (4.33) to satisfy the boundary conditions V(a) = Vo and V (b) = 0. Solving 
these two linear equations we find 


, „ In (b/r) J „ , x dV Vo 1 

V(r) = Vo- and E r (r) =-=-, 

In (b/a) dr In (b/a) r 


(4.36) 


In order to find the capacitance per unit length C' of this configuration, we need 
A. in terms of Vo, a, and b. This relation can be obtained by applying Gauss’s Law. 
The electric flux per unit length outward through a cylindrical Gaussian surface 
surrounding the inner conductor is 2nVo/ \n(b/a). The charge enclosed per unit 
length is, therefore, by Gauss’s Law, A. = 2neoVo/ In (b/a). Thus, for a cylindrical 
capacitor. 


A. 2neo 
Vo In (b/a)' 


(4.37) 


Potential V(r, <p) for Two Infinitely Long Line Charges +A. and —A. 

Figure 4.15 shows two infinitely long line charges with equal but opposite linear 
charge densities +A. and —A., parallel to the z axis, a distance 2d apart. We shall see 
later that the equipotentials for this system are circular cylinders, an interesting 
and useful result. For now we find the potential in plane polar coordinates (r, <p) 
defined by letting r be the perpendicular distance to the z axis, and letting cp be the 
angle around the z axis measured from the x axis, positive in the counterclockwise 
sense. (That is, (p is the azimuthal angle of cylindrical coordinates.) The potential 
is the superposition 

V(r , (p) = In r+ + —lnr_ = —ln (—^\ (4.38) 

Z7T60 Z7T60 Z7T60 \ r + / 


where r± are the distances. The firsttenn in (4.38) is the contribution of the charge 
+A., the second term is that of —A., with both in the form of (4.34). Using the law 
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FIGURE 4.15 Two infinitely long line charges. The potential is (4.38). 


of cosines to express r + and r_ in terms of (r, 0) gives 


V(r, 4>) 


A. r 2 + d 2 + 2dr cos 0 

-In —? -t-. 

4treo + d l — 2dr cos0 


(4.39) 


The potential approaches zero as r —> oo because even though each line carries 
infinite charge, the net charge on the system is zero. 

Problems with parallel lines and cylinders reduce to two dimensions with polar 
coordinates r and 0, because the system is translation invariant in z. 


Potential of a Line Dipole 

A special case of two infinite line charges is the dipole limit, where d —> 0, and 
A. —*■ oo, with 2Xd remaining finite. In that case, again referring to Fig. 4.15, we 
have a line dipole described by the vector dipole moment per unit length p' = 
2Xd i. (The direction of p' is from the negative charge to the positive charge just 
as for a point dipole.) Note that the unit of p' is the coulomb. 

To derive V (r, 0) in the line dipole limit, we rewrite (4.39) as 


V(r, 0) = 


4 ne 0 


In I 1 + 2—cos0 + 

\ r r l . 


l d d 2 ' 

In I 1—2— cos0 + -T- 

1 r 


and then expand the logarithms for d/r 1 using ln(l+.r) = x — x 2 /2+x 3 /3 + 
• • •. Only the linear terms of this expansion need be retained, so the potential of a 
line dipole is 


2 Xd cos 0 p' n cos 0 p' • ? 

V (r, 0) = v ^ V - 


2neor 


27T6Q r 2neo r 


(4.40) 
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where p' 0 = 2Xd. This result may be compared with the potential of a point 
dipole given in (3.100). Whereas for a point dipole the potential decreases with 
distance as 1/r 2 , for the line dipole the potential falls off more slowly, as 1/r; 
that’s because much more charge is on the infinite line than at a point. Both cases 
have a cosine angular dependence on the angle from the axis of the dipole. 

Cylindrical Problems with Dependence on <p 

Laplace’s equation in cylindrical coordinates has many solutions, including the 
cylindrical harmonics, which we study later. But for now we are already familiar 
with solutions of the form 

C cos d> 

V{r, (p) = A\nr + B H- \-Dr cos cp. (4.41) 

r 

This expression may be compared to the analogous spherical solution (4.23), but 
it is important to bear in mind that the coordinates are different: In (4.41), r is 
the distance from the origin in the plane, and <p is measured from the x axis. The 
first two terms in (4.41) are the same as in (4.33). The third term is the potential 
of a line dipole p' = p' i at the origin for which C = p'/lire o. The fourth term, 
which can be written Dx, is the potential of the uniform field E = Eq i parallel 
to the x axis, with D = —Eq. 


EXAMPLE 5 Figure 4.16 shows a grounded conducting cylinder, of radius a and 
centered at the origin, in an externally applied field E app (x) = i • The pres¬ 
ence of the conductor changes the field. What are the potential V (r, </>), the field 
E(r, <p), and the surface charge density o(<p)‘l 



FIGURE 4.16 Example 5. A grounded conducting cylinder in an applied field perpen¬ 
dicular to the cylinder axis. 
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This problem is the cylindrical analogue to Example 4, and we’ll solve it sim¬ 
ilarly, by evaluating the terms in (4.41). First we argue that A = 0 because there 
will be no net line charge drawn up from ground. For this case, whatever positive 
charge appears on the cylinder to the right of the x =0 plane, an equal amount 
of negative charge will appear to the left. We set the arbitrary constant B = 0. To 
match the applied field far from the cylinder we must have D = —Eo- Thus far 
we have V (r, <p) = —Eor cos <p + C cos (p/r. Finally, the boundary condition for 
the grounded cylinder is V(a, cp) = 0, which requires C = Eqo 2 . The potential 
outside the cylinder is 


V (r, <p) = -Eor cos (p + 


Eqo 1 cos <p 
r 


(4.42) 


The second term is the potential of a line dipole p' = 2neoEoa i , whose magni¬ 
tude is proportional to the cylinder’s cross-sectional area. 

The components of the electric field, obtained from E = — VV in cylindrical 
coordinates, are 


E r (r, <p) 


E<p{r, <p ) 



(4.43) 

(4.44) 


In these equations the first terms are the applied field. The second terms are the 
field contributions from the surface charge on the cylinder, which are the same as 
for a line dipole. On the surface of the conductor the tangential field component 
vanishes, i.e., E^{a, (p) = 0. (Indeed, we could have determined C from this 
condition.) The surface charge density is obtained from the normal component of 
the field by a(cp) = eo E r (a, cp), so 


o(<p) = 2eo£ocos0. (4.45) 

The total charge per unit length on the surface is zero, A to t = / 0 ~ T a (<p)ad<p = 0. 
We recall that in Example 4 there was likewise no net charge on the grounded 
sphere in an applied field. 


St. Elmo's Fire 

On ships and planes in electric storms a corona discharge, called St. Elmo’s fire, 
may be observed in the vicinity of cylindrical conductors such as spars and wet 
ropes, or wing tips and antennas. If the external field were strong enough one 
would expect the discharge along all cylindrical conductors, of whatever diame¬ 
ter, to occur at the same time. This is because the electric field at the surface of 
a cylindrical conductor is, from (4.43), E r (a, cp) = 2Eo cos <p, which doesn’t de¬ 
pend on the curvature. In this context Eo is the electric field of the storm. Although 
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Eq itself may not be large enough to cause breakdown in air, the magnitude of the 
radial field at points on the conductor with 0 = 0 and jr, which is 2£o, could be 
strong enough to cause coronal discharge. (For a sphere in an applied field, the 
field at the surface is 3Eq cos 6 r.) In reality the electrical breakdown occurs first 
at sharp ends, where the field is largest, of spars, wires, and even, in literature, 15 
a harpoon. The phenomenon had superstitious associations for sailors in the past. 


Equipotentials for Two Infinite Line Charges 

Earlier in this section we remarked that the equipotentials of two parallel line 
charges, with charge densities ±A. and separated by distance 2d, are circular cylin¬ 
ders, and that this fact can be used in electrostatic analysis. The system is the one 
shown in Fig. 4.15, for which the potential V(r, 0) is (4.39). 

To find the equipotentials of (4.39) it is convenient to use Cartesian coordi¬ 
nates, substituting x = r cos 0 and y = r sin 0, which gives 


V{x,y) 


X x 2 + y 2 + d 2 + 2 dx 

4jreo x 2 + y 2 + d 2 — 2 dx 


(4.46) 


The equation for an equipotential curve is V{x,y) = Vb; but then (4.46) im¬ 
plies a quadratic relation between coordinates x and y. In fact, the relation is the 
equation for a circle in the xy plane (i.e., a cylinder in three dimensions) 

(x - xo) 2 + y 2 = R 2 (4.47) 


where 


xo = d coth(2.T6o Tq/A) (4.48) 

R = d|csch(27reoVoA)|. (4.49) 


The circle is centered at (xo, 0) and has radius R. The center of the equipotential 
circle is in the region x > 0 if Vo and A have the same sign, and in the region 
x < 0 if they have opposite signs. The equipotential surface with Vo = 0 is the 
line x = 0 (i.e., the yz plane in three dimensions). Some equipotential curves— 
non-concentric circles—are shown in Fig. 4.15. We now apply these results to an 
interesting example. 


EXAMPLE 6 Figure 4.17 shows two conducting cylinders each of radius R whose 
centers are 2xo apart. Let the cylinder on the right be at potential +Vo and that on 
the left at — Vo. What is V (r, <p) in the region between the cylinders? What is C', 
the capacitance per unit length? 

We understand from the previous discussion that these cylinders are equipoten¬ 
tials for a system of two line charges ±A. separated by a distance 2d, as shown in 

15 Herman Melville, Moby Dick, Chapter CXIX. 
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FIGURE 4.17 Example 6. Two conducting cylinders at equal but opposite potentials. 


Fig. 4.15. Our task then reduces to finding A and d in terms of the system param¬ 
eters R, .To, and Vo- This is straightforward, although the algebra is cumbersome. 

From (4.46) and the discussion following it, we have the relations (4.48) and 
(4.49) for *o and R. To determine A, divide (4.48) by (4.49) and solve for A. The 
result is A = 27reoVo/arccosh(xo/^). Now (4.48) and (4.49) imply 

d = — R 2 = R sinh(2;reoFb/A), (4.50) 

by the identity coth 2 x — csch 2 x = 1. Substituting these results into (4.39) the 
potential is 


V(r, <t>) = 


Vo 


r 2 + Xq — R 2 + 2ryjxQ — R 2 cos (p 
2arccosh(xo /R) “ r 2 +x 2 _ R i _ 2r jx 2 - R 2 co^' 


(4.51) 


It isn’t surprising that the result is somewhat cumbersome, because it contains all 
the information needed to find the field between the cylinders and their surface 
charge densities. 

Now that we have done the hard work, it is simple to find the capacitance 
per unit length C'. We have, recalling that the potential difference between the 
cylinders is 2 Uq, 


A ne o 

2Vo arccosh(xo//?) 


(4.52) 


Example 6 was set up to treat a symmetric case of two identical cylinders. But 
the technique we used to fi nd V (r, cp) and C' can be generalized to the problem of 
two arbitrary infinite circular cylinders with parallel axes, i.e., any of the equipo- 
tential surfaces for Fig. 4.15, including cases in which one cylinder is inside the 
other and nonconcentric. 
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4.4.2 ■ Problems with Cylindrical Symmetry and an External Line Charge 

The prototype image problem for line charges and conductors is a single infi¬ 
nite line charge, with linear charge density + A, above and parallel to an infinite 
conducting plane at perpendicular distance d. Picture the plane as the surface 
boundary of a conductor that fills the half-space below it. As might be guessed by 
analogy with the problem of a point charge above a conductor, the image is a line 
charge —A, parallel to the physical line, and at distance d below the conducting 
plane. The field and potential above the plane are equivalent to those of the line 
charge and its image. The conductor is then at zero potential, and as usual there is 
no field inside it. Some implications are explored in the Exercises. 16 

There are also interesting problems involving cylindrical conductors and ex¬ 
ternal line charges that can be solved with images. The image of a line charge 
in a cylindrical conductor is in one way simpler than for the spherical case, be¬ 
cause the image of a line charge + A has linear charge density simply equal to —A. 
However, these problems can have some difficult points associated with infinite 
amounts of charge. 

Figure 4.18 shows the intersection of the xy plane and an infinite cylindrical 
conductor of radius R centered at the origin. An external line charge +A passes 
through the point (xo , 0), parallel to the z axis, which is the cylinder axis. We’ll 
consider the case of an isolated uncharged conductor. The image required to make 
the conductor surface be at constant potential is a line charge —A, also parallel to 
the z axis, which passes through the point ( x' 0 ,0), where xoXq = R 2 . That is, 
(xq, 0) is the conjugate point to (xo, 0) in the circle of radius R, the analog in two 
dimensions of the conjugate point in the spherical case treated in Sec. 4.3.2. In 
addition, a fictitious line charge + A, parallel to the other two, passes through the 
origin; this additional charge is needed to make the net line charge density of the 
cylinder zero. 


(r, <t>) 



FIGURE 4.18 A line charge outside an uncharged cylinder. 


l6 See Exercise 22. 
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Referring now to Fig. 4.18, the potential due to the three line charges is 

A 


V(r, <P) = 


A , A , 

In r+ + r-In r_ 


2ne 0 




2neo 


In r + const. 


(4.53) 


The first three terms are, in order, the contributions of the external charge, of its 
image, and of the charge at the center. If we now express the distances in terms of 
r and <p by using the law of cosines, (4.53) becomes 


V(r, <f>) = 


•In- 


r +xa 


2rx' 0 cos <p 


A r 
ln- 


47reo r 2 + Xq — 2rxo cos <p 2neo 


(4.54) 


An arbitrary constant a in (4.54), with dimensions of length, has been introduced 
to make the last logarithm dimensionless. Finally we use the conjugate relation 
Xq = R 2 /x o to eliminate x' () and combine the logs, to write 


V(r, <P) = 


4n€o 


In 


R~a 2 (R 2 + Xq r 2 /R 2 — 2rxo cos (f>) 
x^r 2 {r 2 + Xq — 2rxo cos <p) 


(4.55) 


If (4.55) is now evaluated at the surface of the cylinder, the result is V(R, <p) = 
— (A/27reo) ln(xo/cr), which is constant. By the uniqueness theorem we know that 
(4.55) is the potential for our problem. It can therefore be used to find the electric 
field outside the conductor and to find the surface charge density on the conductor. 
The constant a is still arbitrary and can be used to set the cylinder to any constant 
potential. We could, for example, choose a = xo so that the potential on the 
cylinder would be zero. However, the choice of potential on the cylinder is not 
significant. In particular, it cannot be referred meaningfully to the potential at 
infinity. The reason is that V oo as r —>■ oo, a consequence of the infinite 
amount of charge. 

The case of an infinite line charge +A in the presence of an infinite isolated 
charged conducting cylinder with linear charge density —A, is also interesting. In 
this case all flux lines from the external charge will end on the cylinder. The linear 
densities being equal but opposite implies that the asymptotic field decreases as 
\/r 2 for large r, so the field energy per unit length is finite. The potential for the 
region r > R has then only two contributions, one from the real line charge and 
the other from its image line charge —A at the conjugate position. The potential 
for this case is then 


R 2 (R 2 + x 2 r 2 /R 2 - 2rx 0 cos <p) 

_ _ v . ---— . (4.56) 

Xq (r z + Xq — 2r x 0 cos <p) 

No constant term is added, so that the potential tends to 0 as r oo. The potential 

on the conductor is finite, 


V(r, <f>) = 


47TC0 


In 


V(R,4>) = -^-ln(-). 

27TC0 \*0 / 


(4.57) 
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FIGURE 4.19 Field lines (solid curves) and equipotentials (dashed curves) for the prob¬ 
lem of a charged conducting cylinder (with dQ/dz = —A.) and an external line charge 
(with dQ/dz = +A). There is a fictitious image line inside the cylinder. 


Figure 4.19 shows electric field lines and equipotentials for this problem. 

The difficulties associated with infinite line charges are artificial in the sense 
that all real charge sources are finite. Near a long charged wire, i.e., at distances 
that are small compared to the length, the field is given accurately by the two- 
dimensional considerations with coordinates r and 4>. However, at distances that 
are comparable or larger than the length, the problem becomes three-dimensional 
and the potential may be taken to be zero at infinity. 
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EXERCISES 

Sec. 4.1. Properties of Conductors 

4.1. Suppose a conductor has a cavity inside it, and there is a point charge q somewhere 
in the cavity. Prove that the net charge on the wall of the cavity must be —q. If the 
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net charge of the conductor is Qq, what is the net charge on its outer surface? Why 
must every field line in the cavity begin on q and end on the cavity wall? (Hint: For 
the last statement, use the fact that <f E • di = 0.) 


Sec. 4.2. Charged Plates and Capacitors 

4.2. Consider the generalization of Example 1, in which there are three isolated parallel 
conducting plates whose lateral dimensions are large compared to their separations. 
Letthe total charges be Q' = 1 C, Q" = 4C,and Q"' = 7C, on the bottom, middle, 
and top plate respectively. What are the charges on each of the six faces? 

[Answer: 6,-5; 5,-1; 1,6 coulombs] 

4.3. A parallel plate capacitor has plates of area A a distance x apart. The lateral dimen¬ 
sions of the plates are much larger than x . 

(a) The capacitor is charged to potential Vo with a battery so that the plates.carry 
charges +Q and — Q. The battery is then disconnected. How much work must 
be done by an external force in order to increase the separation a small distance 
from x to x + Aj:? What is A U = f/fi n al — (/initial’ i- e -, the change in energy of 
the capacitor? 

[Answer: A U = [^£) 2 /(2eo-A)J A*] 

(b) Suppose the battery had remained connected as the external force increased the 
separation. How much work would then be done by the external force in increas¬ 
ing the separation from x to x + A*? What is A U — f/f lna i — (/initial for this 
case? Show that energy is conserved if all sources and sinks of energy, including 
the battery itself, are considered. 

[Answer: A U = —cqAVq Ax/(2x(x + A*))] 

4.4. Prove C — C\ + Cj for two capacitors in parallel, and C -1 = Cj -1 + C' 2 1 for two 
capacitors in series. 

4.5. (a) Derive the energy in a parallel plate capacitor, starting from the electric field 

energy density U£ = /2. 

(b) Derive the energy in an arbitrary capacitor (two conductors with equal but op¬ 
posite charges) by integrating the energy required to move charge dq from the 
conductor at lower potential to the conductor at higher potential. 


Image Charges 

4.6. A point charge +q is fixed at the point (0, 0, zq) above the grounded, conducting 
plane z = 0. 

(a) What is the total charge induced on the plane? 

(b) Find the radius of the circle centered at the origin in the xy plane which encloses 
half of the total charge induced on the plane. 

[Answer: R = zox/3] 

4.7. The plane z = 0 is a grounded conductor. A point charge +q is atthe point (0, 0, zq). 
(a) What i s the force o n the charge? 
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(b) What is the energy in the field? How does this compare with the energy in the 
field of two charges +q and —q at a separation of 2zo? 

[Answer: Energy in the field = — q^ / (Ibne^z^)] 

4.8. Calculate the electric field on the z axis due to the planar surface charge density in 
Eq. (4.17), by integrating the Coulomb fields of elemental charges on the surface. 
Show that the result is the same as a point charge —q at (0, 0, — zq) for field points 
in the region z > 0; and the same as a point charge —q at (0, 0, zq) for field points 
in the region z < 0. Why does this justify the image charge method? 

4.9. The planes y = 0 and z = 0 are grounded conducting surfaces. A point charge q is 
at (0 , a, a). Show that in the x = 0 plane, in the quadrant y, z > 0, the leading term 
in the potential for r 3 > a is 

3 qa^ 

V (r, 9) = - 7 sin G cos 9 

where r is the radial distance from the origin and 9 is measured clockwise from the 
z axis. (Hints: First convince yourself that the monopole and dipole terms are zero 
for the charge configuration. Then, in the sum of the four inverse distances retain 
only terms of order a 2 /r 3 for r s> a. Take full advantage of the symmetry.) 

4.10. (a) Consider a variation of Example 2, in which the z = 0 plane is grounded and 

there is a dipole p = pqk at (0, 0, zq). What are the leading terms in the potential 
and electric field, on the z axis for z 3> zo? 

(b) Now consider the general case in which the z = 0 plane is grounded and there 
is a dipole p = po(j sin 9q + k cos 0 q) at (0, 0, zq). What are the leading terms 
in the potential and in each component of the electric field, on the z axis for 
z » z 0 ? Confirm that your result reduces to that of part (a) for 9q = 0 and to 
that of Example 2 for 9q = jr/2. 

[Answer (b): 

£ z (0, 0, z) = PoCOsGo/infQZ 3 ) and £j,(0, 0, z) = -3p 0 zo sin0o/C27re o z 4 ) ] 

4.11. Two grounded conducting planes meet at an angle 9 = 60 degrees at the origin. A 
point charge q is a distance ro from the origin along their angular bisector. What 
image charges are needed to satisfy the boundary conditions? What is the r depen¬ 
dence of the potential, on the angular bisector, for r ro? (Hint: Choose the most 
symmetric arrangement, with the real charge on the horizontal axis.) 

[Answer: The leading term is V(r) — \Sqr^/ 

4.12. TWo grounded conducting planes meet at an angle 9 at the origin, and a point charge 
q is a distance ro from the origin along their angular bisector. (See Exercise 11.) 
Show that if 9 = 2n/n, where n is any even integer, then n — 1 image charges 
are required to ground both sides of the wedge. Show that if, however, n is an odd 
integer then an image solution cannot be obtained. (Hint: Consider the case n = 3.) 

This problem raises an interesting question about the connection between physics 
and mathematics. For the special cases where the angle between the plates is 2n/n 
with n even, it is not difficult to find the potential and field, exactly, by analytical 
mathematics. But for all other angles the mathematics is too hardfor us. Why should 
this be? In nature there is nothing special about any angle, because in setting up an 
experiment we can set the plates as easily at one angle as at another. 
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z 

■ q 


-z = 0 



FIGURE 4.20 Exercise 16. A point charge above a grounded plane with a hemispherical 
boss. 


Sec. 4.3. Spherical Symmetry 

4.13. Sketch graphs of the potential V(r) and electric field E r (r) for two concentric 
spheres held at potentials Vo an d 0, respectively. 

4.14. Two concentric conducting spherical shells with radii a and b have charge +Q and 
— Q respectively. Calculate the total field energy U by integrating «q£ 2 /2 over all 
space. From the result, what is the capacitance? 

4.15. A grounded conducting sphere of radius R has its center at the origin. Outside the 
sphere, at the point (0, 0, zo), is a point charge +q. 

(a) What is the force on the charge? 

(b) Plot the force as a function of zq/R using a computer program. 

(c) What is the potential energy as a function of zo? 

[Answer: U = —q^R/[8neo(zQ — /? 2 )].] (Hint: The potential energy is the external 
work done in bringing the charge from infinity to z = zo ) 

(d) What are the field components E r (r , 6) and Eg(r, 6)1 

4.16. Figure 4.20 shows the grounded conducting plane z = 0, and on it a grounded 
conducting hemispherical boss of radius Rq whose center is at the origin. A charge 
+q is at the point (0, 0, zo). 

(a) What is V(r, 6 )? (Hint: There will be three image charges.) 

(b) What fraction of the total induced charge is induced on the hemispherical boss? 

[Answer: 1 - (zg - Rq)/(zo^zI + r|)] 

4.17. (a) Suppose that the inner conductor in Fig. 4.11 is a spherical shell instead of a 

solid sphere, and that both it and the outer spherical shell are isolated from each 
other and from ground. If charges Q' and Q" are put on the inner and outer 
conductors, respectively, what will the charge be on each of the four spherical 
interfaces at equilibrium? 

(b) Consider now the case of three concentric spherical, isolated, conducting shells. 
Charges Q', Q", and Q"' are put on the innermost, intermediate, and outer con¬ 
ductors, respectively. What is the charge on each of the six spherical interfaces 
at equilibrium? 



130 


Chapter 4 Electrostatics and Conductors 


[Answer: From innermost to outermost the charges are: 0, Q'\ — Q', Q'+Q"\ — Q f — 
Q", Q' + Q" + Q'" ,\ Notice how this distribution compares with that in Exercise 
4.2, the analogous planar problem. 

4.18. Any two conductors can be considered to be a capacitor. Suppose one conductor is a 
sphere of radius a and the other is a sphere of radius b. The centers of the conductors 
are a distance d apart, where d 3> a, b. What is the capacitance? 

[Answer: C = 4jt€Qab/(a +6).] 

Notice that this interesting result is independent of the separation. The underlying 
physics is that the potential of each conductor is dominated by its own charge and 
shape, and only weakly dependent on the other conductor. This means that C for any 
two conductors with separation large compared to their dimensions will depend on 
d only weakly. 

4.19. An arbitrarily shaped grounded conductor has within it a spherical cavity of radius 
a. Define the origin to be the center of the cavity. There is a point charge q in the 
cavity at distance b from the center. Define the z axis to be the direction from the 
origin to q. 

(a) Use the method of images to find the potential V ( r ) in the cavity. 

(b) Determine the charge density a (9) on the cavity wall. 

(c) Evaluate Jq a (9)2 jt a 2 sin Odd, the total charge induced on the wall. Explain 
the result by Gauss’s Law. 

[Answer (b): 

„(« = — , ~° 2) —„] 

4 7ta(a 2 + b 2 — 2ai>cos0)3/ 2 

4.20. An isolated spherical conductor of radius a and with charge Qq is placed in a uni¬ 
form electric field whose magnitude is Eq. 

(a) What must Qq be in order that a (9) > 0 everywhere on the surface of the 
sphere? 

(b) Sketch a graph of the function a (0). 

4.21. A point charge q is at a distance zo from the center of an isolated spherical conductor 
of radius R. The charge q is outside the sphere (zo > R) and the conducting sphere 
has charge Qq on it. 

(a) What must Qq be in order that a(9) > 0 everywhere on the surface of the 
sphere? 

(b) Plot the resulting a (9) using a computer program. For the plot assume zo = 3 R. 
Sec. 4.4. Cylindrical Symmetry 

4.22. Consider y = 0 to be a grounded conducting plane. There is an infinitely long line 
charge above it. The line charge is parallel to the z axis, intersects the xy plane at 
the point (0, yo) and has constant linear charge density +X. 

(a) Find the surface charge density a(x, z) on the y = 0 plane. 

(b) Show that the induced charge per unit length on the plane is —X. 
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FIGURE 4.21 Exercise 23. A cylindrical cavity in a conductor, with a line charge X at 

W, 0). 


4.23. A cylindrical conductor with arbitrary cross-sectional shape has i n it an infinitely 
long cavity with circular cross-section, radius R, centered at the origin, as shown in 
Fig. 4.21. The axis of the conductor and cavity is the z axis. An infinite line charge 
+A inside the cavity is parallel to the z axis and passes through the point (d, 0). 

(a) Find an expression for the potential V ( r , 9) inside the cavity, in terms of plane 
polar coordinates. (V (r, 9) is determined up to an additive constant.) 

(b) What is the charge density a (9) on the wall of the cavity? 

[Answer (b): 

X(d 2 - R 2 ) 

o{9) =-=-=-] 

2nR(R 2 +d 2 -2dRcos9) 

(c) What is the total charge per unit length induced on the wall? 

(d) What is the force per unit length on the line charge? 

General Exercises 

4.24. A point charge q (with q > 0) is located at distance 2a from the center of a grounded 
conducting sphere of radius a; let the center of the sphere be the origin and the 
position of q be on the z axis at (0, 0, 2a). What is the electric field at (0, 0, —2a)? 
I s i t directed toward o r away from q ? 

4.25. An isolated conductor, in the shape of an ellipsoid of revolution with foci on the z 
axis at z = ±f, has total charge Q. 

(a) What is the potential function V(x)2 Use cylindrical coordinates (r, <p, z) and 
the results from Chapter 3, Example 10. By the uniqueness theorem if you find 
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a solution of Laplace’s equation that satisfies the boundary conditions, it is the 
potential. 

(b) Let c be one-half the length of the major axis of the ellipse, and a one-half the 
length of the minor axis, so that the equation for the ellipse in the xz plane is 
(x/a) 2 + (z/c) 2 = 1. Also, the eccentricity is £ = f/c, and a = cyl—e 2 . 
What is the potential Vo of the conductor, in terms of c and £? 

[Answer: Vq = (g/87reo £C ) MO + £ )/0 - £)}] 

(c) Determine the surface charge density at (x, y, z) = (0, 0, c) and at (x, y, z) = 
(a, 0, 0). Express the results in terms of c and £. Note that for a needle (e ~ 1) 
the surface charge at the end is much greater than at the middle. 

Computer Exercises 

4.26. Consider a charge q halfway between two grounded conducting plates. The electric 
field can be calculated by the image method, with an infinite number of images. On 
the midplane the field points away from q, by symmetry. Use a computer program 
to plot the contribution to the electric field from the image charges (i.e., the full 
field minus the Coulomb fi eld of the real charge) on the midplane, as a function of 
distance from q. At what points on the midplane is the field due to the conducting 
planes maximum in magnitude? 

4.27. The region z £ 0 is a conductor. On the positive z axis there is a point charge +Q 
at z = (■ and a point charge — Q at z = 21. 

(a) Sketch a picture of the electric field lines. 

(b) Use a computer to make a log-log plot of E z (0, 0, z) as a function of z, for 
z > 2 1, and from the slope determine the asymptotic dependence on z. 

(c) Use a computer to plot E z (x, 0, 0) versus x. On the xy plane, where is E = 0? 



CHAPTER 



General Methods for 
Laplace's Equation 


“The miracle of the appropriateness of the language of mathematics for 
the formulation of the laws of physics is a wondetful gift which we neither 

understand nor deserve.” 

- E. P. Wigner 


“I never came across one of Laplace’s ‘Thus it plainly appears’ without 
feeling sure that I have hours of hard work before me to fill up the chasm 

and find out how it plainly appears.” 

- Nathaniel Bowditch, American mathematician and astronomer 


The most general problem in electrostatics is to determine the electric field E(x) 
for any static system of charges in the presence of solids, liquids, and gases. This 
kind of problem can be formidable; it frequently cannot be solved analytically. So 
far in our study we have limited ourselves to considering static systems of charges 
in the presence of conductors. Even those problems can be difficult, as we have 
seen. The approach used in physics to leant about a difficult general problem is to 
solve selected simple problems, which in electromagnetism means problems with 
a lot of symmetry. This chapter is another step for us in that approach. 

We learned in Chapters 3 and 4 that the general problem can be reduced to 
finding a scalar function—the potential V (x)—which is usually easier to calcu¬ 
late than the vector function E(x). The field, charge density, and potential are 
connected by 


E(x) = -W and p(x)=e 0 V-E. (5.1) 

The first of these equations permits us to find the field from the potential, and the 
second relates the field to the sources of charge. 

Of particular interest to us in this chapter is the potential function in the free 
space of the system. Free space means the space outside the sources of charge. 
Those sources may be isolated charges, or charged surfaces, or charged volumes. 
But because p(\) =0 in free space, V (x) satisfies Laplace’s equation 

V 2 V = 0. (5.2) 
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An important class of electrostatics problems are those in which the potential 
or charge distribution is given on conducting surfaces. In these problems we seek 
the solution to (5.2) in the region bounded by the surfaces, i.e., in the free space 
outside the conductor. Such problems are collectively called boundary value pr ob¬ 
lems. We picture the boundary of the free space as a set of one or more surfaces 
at which V (x) has a known behavior. On a conducting surface V (x) is constant, 
independent of position x on the surface. At a charged conducting surface the 
normal (n) component of the electric field is a/e o, where a is the surface charge 
density. The general procedure to find V (x) in the free space, i.e., to solve the 
boundary value problem, is to construct the solution of the partial differential 
equation (5.2) with known values of V(x) or n • VV on the boundary surfaces. 
Specifying V(x) on a surface is called a Dirichlet boundary condition, and speci¬ 
fying n • VV is called a Neumann boundary condition. 

In Chapter 4 we studied a variety of solutions to Laplace’s equation. However, 
all of those examples concerned highly symmetric systems. The solutions were 
generated by mathematical tricks, such as the method of images, or by writing 
the solution as a linear combination of a few known solutions and picking the 
coefficients to satisfy the boundary conditions. The purpose of the present chap¬ 
ter is to examine three general methods for constructing solutions to Laplace’s 
equation. The method of separation of variables (Secs. 5.1-5.3) can be applied 
to problems with rectangular, spherical, or cylindrical boundaries; this material is 
the main subject of the chapter. The method of conjugate functions (Sec. 5.4) ap¬ 
plies to two-dimensional problems. The method of iterative relaxation (Sec. 5.5) 
is a numerical method. 

If we find a function V (x) that satisfies Laplace’s equation and the boundary 
conditions of a system, then V (x) is the unique solution of the boundary value 
problem. No matter how we found the function—whether by one of the tricks 
in Chapter 4, by one of the classical techniques of the current chapter, or by a 
lucky guess—the problem is solved. General methods are important because they 
provide approaches to complicated cases. But solving Laplace’s equation is never 
routine, and most solutions are found by a combination of educated guess-work 
and classical analytic techniques. 

The Uniqueness Theorem 

To justify the first statement in the previous paragraph, we shall prove the unique¬ 
ness theorem for Laplace’s equation. Suppose Vi (x) and lAjx) are both functions 
that satisfy Laplace’s equation (5.2) in a volume V of space, and that satisfy the 
same boundary conditions on the set of boundary surfaces S of the volume. To 
prove the uniqueness theorem we must show that Vj(x) = V 2 (x); i.e., there is 
only one solution. 

Let F(x) be the function 

F(x) = (V, - V 2 ) V (Vi - V 2 ), (5.3) 

and consider the integral of V • F over the volume V of interest. By Gauss’s theo- 
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rem, 


[ V-Fd 3 x = f (V) - V 2 )n- V(Vi - V 2 )dA. (5.4) 

Jv Js 

Again, S is the set of boundary surfaces, which may include surfaces at infinity. 
But both V\ and V 2 satisfy the same boundary conditions on S', so on S, either 
V\ = V 2 for the case of a Dirichlet boundary condition, or n • V V\ = n • V V 2 for 
the case of a Neumann boundary condition. In either case the right-hand side of 
(5.4) is 0. 

Now, F(x) has the form /G, where / is the scalar V\ — V 2 and G is the vector 
V(Vi — V 2 ). Using the identity V • (/G) = V/ • G + /V • G, the divergence of 
F is 


V F = V(Vi - V 2 ) • V(Vi - V 2 ) + (Vi - V 2 ) V 2 ( V\ - V 2 ). (5.5) 

The final term is 0 because both V\ and V 2 satisfy Laplace’s equation. Thus (5.4) 
becomes 


/ mVi-V 2 )] 2 d 3 x= 0. (5.6) 

Jv 

The integrand is never negative, so the only way the integral can be 0 is if V(Vi — 
y 2 ) = 0 for all x; that is, V\ (x) — V 2 (x) must be a constant C independent of 
x. If there is any Dirichlet boundary condition in the system then C must be 0 
because V\ (x) and V 2 (x) would have the same value at the surface with a Dirichlet 
boundary condition. Hence, 


Vi(x) = V 2 (x), (5.7) 

and the uniqueness theorem is proven. If there are only Neumann boundary con¬ 
ditions (an uncommon case), then V\ (x) and V 2 (x) may differ by an arbitrary 
constant, but the eleclric field E(x) is the same for V\ (x) or V 2 (x). 


5.1 ■ SEPARATION OF VARIABLES FOR CARTESIAN COORDINATES 

An important general method for boundary value problems is separation of vari¬ 
ables. The best way to leam the method is to apply it to some example problems, 
which we’ll do below. But a general overview is also useful. 

The first step in this method is to construct all solutions to Laplace’s equation 
that have the form of a product of separate functions for each of the three co¬ 
ordinates. Then the solution of any boundary value problem can be written as a 
superposition of separable functions. The determination of V (x) reduces to find¬ 
ing the coefficients of the superposition such that all the boundary conditions are 
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satisfied. The method may be applied to Cartesian, cylindrical, or spherical coor¬ 
dinates, whichever choice is best suited to the system of interest. 1 

To analyze an electrostatic system with rectangular boundaries, it is natural to 
work in Cartesian coordinates x, y, z. 


5.1.1 ■ Separable Solutions for Cartesian Coordinates 

First, construct the solutions to V 2 V = 0 that have the separable form 

V(x, y, z) = X(x)Y(y)Z(z). (5.8) 


Not all solutions have this form, but we can use linear combinations of these sepa¬ 
rable solutions to solve any boundary value problem with rectangular boundaries. 
For (5.8) Laplace’s equation becomes 


d 2 X 

dx 2 


d 2 Y d 2 Z 

YZ + — XZ + — t XY 
dy z dz l 


0 ; 


or, dividing by X Y Z, 


1 d 2 X 1 d 2 Y 1 d 2 Z 
X dx 2 Y dy 2 Z dz 2 


(5.9) 


(5.10) 


The first term depends only on x, the second on y, and the third on z. But these 
variables are independent, so the only way these three functions can sum to 0, for 
all values of x, y, and z, is if each term separately is a constant, and the three 
constants sum to 0. That is, Laplace’s equation demands 


1 d 2 X 2 1 d 2 Y 2 1 d2z 2 

Xtf _ " 1 ’ YTy 2-*2’ zd? - * 3 ’ 


(5.11) 


where 


K 2 + Kj + K 2 = 0. (5.12) 

We denote the constants by k 2 ,k^, /c| because then X(x), Y (y), Z(z) have a con¬ 
venient form. The partial differential equation V 2 V = 0 has separated into three 
ordinary differential equations. The parameters k\ , /q, kt, may be real or imagi¬ 
nary, but they cannot all be real, nor all imaginary, because then the sum of their 
squares would not be 0 as required by (5.12). At least one of them must be real 
and at least one must be imaginary. We will soon leam that the choice of real or 
imaginary k, is forced by the boundary conditions in an interesting way. 

A special case of solutions to (5.11) is for k\, /q, kt, all equal to 0. In that 
case the solutions are X(x) = ax + b, Y (y) = cy + d, Z(z.) = ez + /, where 

’Separation of variables can be applied to other coordinate systems, such as ellipsoidal or toroidal 
coordinates. These are appropriate for boundary value problems where the boundary is a surface on 
which one of the coordinates is constant. 
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a ... f are constants. Combining these factors in (5.8) we get the special solution 
to Laplace’s equation 

V(x, y, z ) = (ax + b)(cy + d)(ez + /), (5.13) 

which is used if there is a uniform field or in other special cases. 2 

The general solution of the differential equation in (5.11) for X(x) is a linear 
combination of exponentials e K[X and e~ K>x . The functions Y (v) and Z(z) are 
analogous. Thus 


X(x) = C,e k ' x + Die- KIX (5.14) 

Y(y) = C 2 e Kiy + D 2 e~ K2y (5.15) 

Z(z) = C 2 e K3Z + D 2 e~ K3Z (5.16) 

where the C’s and D's are constants. These functions deserve some discussion. If 
k\ is real, then as an alternative to (5.14) we may write X(x) = C\ cosh k\x + 
D' x sinh k\x. The choice of which expression to use is determined by the geometry 
of the system. For example, if x = +oo is included in the region where V (x) is 
sought, then use X (x) = D\e~ K,x , because any of the other candidate functions is 
infinite there. Similarly, if* = — oo is included in the region, use X(x) = C\e K]X . 
On the other hand, if the region of interest is bounded in x, then use a linear 
combination of cosh and sinh, which are even and odd functions of x, respectively. 
If from the symmetry of the system it can be seen that the potential is even in x, 
then use only X(x) = C\ cosh/qx. Or, if the potential is odd in x, then use 
X(x) = D\ sinh/qx. Use similar considerations for Y(y) and Z(z). 

If k\ is imaginary, which is the case for k 2 negative, then we may alternatively 
write (5.14) as X(x) = C " cosk\x + D" sin&i* where k\ = |/q |. This form has 
the advantage that it is manifestly real, and separated into functions with definite 
parity: cos k\x is even in x and sin k\x is odd. If from the symmetry of the problem 
the potential is even in x, then use only X(x) = C” cosfcix, and if the potential 
must be odd, then use only X(x) = D" sin/qx. 

We have generated within the separable form (5.8) a large class of solutions 
to Laplace’s equation. Furthermore, any linear combination of these solutions is 
also a solution, because Laplace’s equation is linear. For a given boundary value 
problem we must find the linear combination that satisfies the boundary condi¬ 
tions. It is always possible to find such a linear combination, by the completeness 
theorem, which states that any solution of Laplace’s equation can be expanded in 
functions of the form (5.8). Of course, as a practical matter, we would only use 
this class of separable solutions for problems with rectangular boundaries. 

We now apply this technique for solving boundary value problems to some 
specific examples. 

2 See Exercise 1. 
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5.1.2 ■ Examples 


EXAMPLE 1 Consider a square conducting pipe. The length L is much larger 
than the dimensions of the cross section, which is a square of size a x a. The 
walls are insulated from each other. Two opposite walls are grounded (V = 0) 
and the other two are held at potential Vo. What is the potential function inside 
the pipe? 

Set up coordinates as shown in Fig. 5.1. The z axis is the central axis of the 
pipe. The grounded walls are those at x = ±a /2. Because the pipe is very long we 
may consider the potential to be independent of z, so V = V(x, y). A separable 
solution for this effectively two-dimensional problem is 

X(x)Y(y ) = cosf&i*) cosh(K 2 .v) (5.17) 

with — k\ + = 0. Recall from the general discussion that one of the functions 
X ( x ) and Y (y) must be sinusoidal and the other hyperbolic. Note that X (x) must 
be the sinusoidal one, to satisfy the boundary conditions V(±a/2, y ) = 0; those 
conditions applied to the separable solution X ( x ) Y (y) become 

X(a/2) = X(—a/2) = 0. (5.18) 

A hyperbolic function (sinh or cosh) cannot have two zeroes, 3 so X(x) must be 
sinusoidal. Then Y (y) is hyperbolic, and it must be the even function cosh Kiy 


y 



a/2 


-a/2 

1 


a/2 






—a/2 

\ 



Vo 



FIGURE 5.1 Example 1. A square conducting pipe shown in cross section. The axis of 
the pipe is the z axis, pointing out of the page. The walls at x = ±a /2 are grounded, and 
the walls at y = ±a/2 are held at the potential Vo- 

3 The definitions are cosh* = ( e x + e~ x )/2 and sinh* = (e x — e~ x )/2. Note that cosh* is nowhere 
0 and sinh * is 0 only at * = 0 (cf. Refs. 5-7). 
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by the symmetry of the system, because the system is invariant under the reflec¬ 
tion y —*■ —y. The boundary condition (5.18) requires that k\a is an odd integer 
multiple of n. 


cos(&ia/2) = 0 implies k\={2n + \) — \ (5.19) 

a 

here n is a nonnegative integer. Also, for (5.17) to be a solution of Laplace’s 
equation it is necessary that K 2 = k\. 

The general solution of Laplace’s equation f or V (x , y) in the square pipe, with 
V = 0 on the walls at x = ±a/2 and V (x, y) even in y, is a linear combination 
of the separable solutions (5.17) 

00 

V (x, y) = cos |^(2n + 1)-— J cosh |^(2n + 1) — J . (5.20) 

n= 0 a a 

Now we must determine the coefficients C„ to satisfy the other boundary condi¬ 
tions 


V(x, a/2 ) = V(x, -a/2) = V 0 , (5.21) 


which implies 


Vo = cos |^(2n + 1)—J cosh |^(2n + 1)—j . (5.22) 

n=0 ° 

Equation (5.22) is quite remarkable: A sum of oscillatory functions (of x) on the 
right-hand side is a constant. What are the coefficients C„ that produce this result? 
Finding the C n ’s is a problem in Fourier analysis. 


Comments on Fourier Series 

If f(x) is a periodic function with period 2 a then it can be expanded in sines and 
cosines in the form 


OO 

f (x) = ao +^2 a k cos 

k=\ 


/ knx\ ~ , 


sin 


k =l 




(5.23) 


The right-hand side is called the Fourier series or Fourier expansion of f(x). 
(Note that cos (knx/a) and sin(&7rx/a) are periodic in x, with period 2a, if k is 
an integer.) The set of sines and cosines are called orthogonal functions, and they 
satisfy the orthogonality relations 



jnx knx 

sin-sin- ax = adjk, 

a a 
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jnx knx 

cos-cos -- ax = aS 

a a 


jnx knx 

sin --cos- ax = 0. 

a a 


jk > 


(5.24) 


Here j and k are positive integers, and Sjk is the Kronecker delta. The proof 
of (5.24) is a straightforward exercise. 1 * * 4 If we multiply f(x) by cos (jnx/a) or 
sin(jnx/a), and integrate over x, then only the cosine or sine term with k = 
j in the expansion (5.23) is nonzero; so we pick out the coefficient aj or bj, 
respectively, by this procedure: 

a, = — [ f(x) cos ^- dx, (for/^ 0) (5.25) 

a J-a a 

bj = — [ f(x)sin^-^-dx. (5.26) 

a J-a a 


The case j = 0 is special: 


a o 


1 r 

2 a J^a 


f (x)dx. 


(5.27) 


As an example, let f(x) be the square wave S(x) plotted in Fig. 5.2. The 
function S(x) is periodic in x with period 2a. Because S(x) is an even function 
the sine components in (5.23) are 0; that is, bk = 0. Also, a o is 0 because the 
average of S(x ) over a cycle is 0. The Fourier coefficient ak is 


1 f a knx 

ak = — S(x) cos-- dx 

a J-a a 

4 (kn \ 

kn V 2 / 


Six) 



FIGURE 5.2 Square wave. S(x) alternates between +1 and —1 with period 2a, and is 
+ 1 in the domain (—a/2, a/2). 

4 See Exercise 2. 
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_ 4 f(—1)” for k = 2n + 1 (odd) 

kn \0 for k = 2n (even). 

Thus the Fourier series for the square wave S(x) is 


4 ^ (-1)" 

S(x) = ~ - x . , n cos 

77 ^ (2n + 1) 


r jtx~\ 

[(2n+l) —J. (5.29) 


In the region —a/2 < x < a/2 the square wave S(x ) is 1. For this domain 
equation (5.29) is like our earlier equation (5.22): a sum of cosines adding up to a 
constant. Of course the x domain (—a /2, a /2) in our electrostatics problem is not 
one-half cycle of a periodic structure, as it would be for the square wave S(x). But 
this difference doesn’t matter: In (5.29) we have a linear combination of cosines 
that is constant in the domain (—a/2, a/2). Comparing (5.22) and (5.29) we see 
that the coefficient in the former must be 

^ 4V 0 (-1)" 


C,cosh[<2„ + l)|] = - 2 —, 


which determines C n . Finally then the potential V (x , y ) in the square pipe of Fig. 
5.1 is 


* ±o 2 " + l 


1 r x x ~\ 

-cos (2n+ 1) — 

1 L a J 


nx -| cosh [(2n + 1 )yry/a] 
a J cosh[(2n + l)7r/2] 



FIGURE 5.3 Equipotentials i n the square conducting pipe o f Example 1 . At the top and 
bottom V — Vq and at the sides V = 0. The potential values are shown in units of Vq. 
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FIGURE 5.4 Surface plot of the potential function V(x, y) in (5.31). The potential 
values are shown in units of Vq. 


We have the solution to the boundary value problem in the form of an infinite 
series. 

Figure 5.3 shows the equipotential curves of the potential V(x, y ). Figure 5.4 
illustrates the potential in another way, as a three-dimensional surface plot of the 
equation h = V(x, y). The surface has a saddle shape. Figure 5.5 has graphs of 
V(x, 0) versus x, and V (0, y) versus y, demonstrating again the saddle shape. In 



FIGURE 5.5 Plots of the potential function V (x, y) in (5.31). (a) V(x, 0) versus x, and 
(b) V(0, y) versus y. 
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general, the solution of Laplace’s equation V 2 V = 0 is the function V (x) that 
minimizes the integral of (VV ) 2 over the bounded space . 5 In this example the 
space is the two-dimensional square. The saddle shape minimizes the integral of 
(VV) 2 , subject to the boundary conditions. To make V V small, V makes itself as 
flat as possible with the boundary values. 

We have a solution, but only in the form of an infinite series. To evaluate the 
potential V(x,y) or field E = — VV at a specific point, with the exception of 
points on the diagonal , 6 a computer must be used. For example, Figs. 5.3-5.5 
were made by a computer. Or, the field at the point (0, a/4) is E y (0,a/4) = 
— dV/dy = —1.07432 Vo/a, where a computer was used to get the final number. 
The potential at the origin is Vq/ 2 by symmetry . 6 


EXAMPLE 2 A conducting pipe with specified end potential. In this example a 
rectangular region is bounded on five sides. Figure 5.6 shows a long metal pipe 
with an end cap at x = 0. The cross section is a rectangle of size £2 x £ 3 . The 
interior, where V 2 V = 0, is the space 

0 < x < 00, 0 < y < £ 2 , 0 < z < £ 3 . (5.32) 

The four sides (the bottom and top surfaces at y = 0 and y = £2 and the side 
walls at z = 0 and z = £ 3 ) are assumed to be grounded, so that V — 0. The 


y 



FIGURE 5.6 Example 2. A rectangular conducting pipe. The bottom and top surfaces 
(y = 0 and y = £ 2 ) and the side walls (z = 0 and z = £ 3 ) are grounded. 


5 See Exercise 24. 

6 See Exercise 27. Reflection symmetry about the diagonal implies that V is Vo/2 on the diagonals. A 
remarkable infinite series is revealed by this result. 
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end at x = 0 has some specified potential Vo(y, z), which for the moment we 
leave arbitrary. The problem is to find V(x , y, z)- This is an example of Dirichlet 
boundary conditions, because V(x, y, z) is specified on the surfaces. Unlike the 
previous example, which reduced to a two-dimensional problem, this problem 
will involve all three coordinates. 

The separable solutions are again X (x)Y ( y)Z(z ), where the separate functions 
are exponential or sinusoidal. They must satisfy the relevant boundary conditions. 
To have V = 0 on the grounded sides we require Y (0) = Y (i 2 ) =0 and Z(0) = 
Z(^ 3 ) = 0. A combination of real exponentials cannot have more than one zero, 
so Y(y) and Z(z) must be sinusoidal, with nodes at the boundaries; that is, 

Y (y) = sin and Z(z) = sin( / ^) (5.33) 

where n and rn are integers > 1. The constants Y"/Y and Z"/Z are 

*2 = —(nn/ 12) 2 and /tf = -{mn/i-i) 2 . (5.34) 

To satisfy Laplace’s equation X (x) must be a real exponential; to keep the poten¬ 
tial finite as x —» 00 it must be the decreasing exponential 


X(x) = C nm e~ w 


(5.35) 


where the decay constant is 


The full potential of the system in Fig. 5.6 may be written as a superposition of 
these separable solutions, 


V(x, y, z) = C nm e KnmX sin (^p^) sin (~T^) ■ ( 5 - 37 ) 

n =1 m= 1 V / V r3 / 

The remaining task is to determine the constants C nm . 

The final boundary condition, at the end at x = 0, is 

VoC)', z) = ^ C « mSin (“^) sin (“^~)- ( 5 - 38) 


Equation (5.38) is a double Fourier series, in y and z. According to the com¬ 
pleteness theorem of Fourier series, any function in a bounded domain can be ex¬ 
panded in trigonometric functions. The theorem guarantees that coefficients C nm 
exist such that (5.38) is satisfied. The orthogonality relation of sine functions in 
this case is 
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where j and n are positive integers and S jn is the Kronecker delta . 7 It is im¬ 
portant to appreciate (5.39) at an intuitive level. The functions sin (jny/ti) and 
sm(nny /ti) oscillate about 0 as functions of y. If j = n they oscillate in phase, 
so the product is always > 0 and the integral is nonzero, in fact ii/l. If j / n 
they are in phase at some positions and out of phase at others, so the product oscil¬ 
lates about 0 and the integral is 0. The orthogonality relation is used to calculate 
the coefficients C nm . Orthogonality is applied to both variables y and z. Multiply 
both sides of (5.38) by (jl/ij) sin(yjry/f 2 )( 2 /f 3 ) sm(knz/(■?,), and integrate over 
y and z. This projects out on the right-hand side the term in (5.38) with nm = jk, 
and hence determines the coefficient Cy* to be 

c>t =ii rc wy - z> sin ijf) ] s,n ) dydz - (5 - 4o) 

This holds for all j and k, so replacing jk by nm gives the coefficient in (5.37). We 
have solved the problem: Given Vo(y, z) the Fourier coefficients are calculated by 
(5.40); and then V ( x , y, z) is given by (5.37). In practice the solution may require 
heavy numerical computation, e.g., to compute the values of the coefficients or 
the field E(x, y, z) at a specific point of interest, because V (x) is expressed as an 
infinite series. With a computer one can evaluate many terms in the series, but it 
will be necessary to truncate the series, i.e., drop the terms after some maximum 
n and m, and the truncation produces some numerical error. 

As a special case, suppose the end potential is just a constant Vo- Then the 
Fourier coefficients obtained from (5.40) are , 8 for n and m both odd integers, 

16Vo 

C nm = - 5 -^. (5.41) 

n L nm 

The coefficients with either n or m even are 0. Note that the coefficients C nm 
factorize into the form C n C m . In the yz plane (x = 0), i.e., the end cap in Fig. 
5.6, the double Fourier series reduces to the product of two single Fourier series, 
in y and z- The one-dimensional Fourier series with sine functions six\(nny/t) 
and coefficients 4/(nrr) for odd n, is a square wave with amplitude 1 and period 
2f . 8 Therefore V (x) obeys the boundary condition V (0, y, z) = Vo (a constant) 
for 0 < y < I 2 and 0 < z < iy. For small x, i.e., near the end cap, V (x, y, z) is 
approximately constant in y and z. For large x the Fourier series is dominated by 
a single term, because all the separable solutions decrease exponentially with x, 
with decay constants tc mn . The dominant term is the one with the smallest decay 

7 The orthogonality relation (5.39) is analogous to those in (5.24), but has a slightly different form 
because here the domain is (0, 12 ) rather than (—a, a). To prove (5.39), reexpress the integrand by 
using the trigonometric identity 2 sin A sin B = cos(/t — B) — cost A + B). 

8 See Exercise 4. 
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constant, namely the term n = m = 1. Thus the asymptotic potential, for large x, 
is 


V(x,y,z) « 



(5.42) 


decreasing exponentially with x. 

In an unbounded region the asymptotic potential of a charge distribution de¬ 
creases as Q/(4neor), or some other power of 1/r if the total charge Q is 0. In 
the grounded pipe the potential decreases exponentially as x -> oo. The reason 
for this marked difference is that in the latter case opposite charge is pulled from 
ground onto the inner walls of the pipe. The charge density on the walls is largest 
at small x. The potential due to the surface charge cancels the potential due to 
charge on the end cap in the limit of large x. This example illustrates the phe¬ 
nomenon that an electric field cannot penetrate far into a space with conducting 
walls. If the space is entirely enclosed, a configuration sometimes called a Fara¬ 
day cage, then the electric field inside is 0. If the space has a small opening, or 
if the potential is fixed at one end as for the rectangular pipe in Fig. 5.6, then the 
field decreases exponentially with distance from the end. 

Figures 5.7 and 5.8 illustrate the potential V (x) in the pipe. In these figures £2 
and £3 have been set equal to £, and distances are measured in units of t. Figure 
5.7 is a graph of the potential V (x, tz/2, £ 3 / 2 ) for points on the central axis of the 
pipe, as a function of distance x from the end. Note that the potential is plotted on a 
log scale. A straight line on a log plot implies exponential decay, so asymptotically 
the potential decreases exponentially. Figure 5.8 shows the equipotential curves 
on the vertical bisector plane z = £3 /2. It is left as an exercise 9 to sketch the 
corresponding electric field lines. 



FIGURE 5.7 Potential on the central axis V(x, tz/2, £ 3 / 2 ) of the rectangular pipe in 
Figure 5.6, as a function of x. The end potential (.t = 0) is a constant Vo- Note that V / Vq 
is plotted on a log scale. The dimensions £2 an d £3 have been set equal to £, and x is 
measured in units of i. 

9 See Exercise 6. 
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FIGURE 5.8 Equipotential curves on the vertical bisector plane z = £ 3/2 of the rect¬ 
angular pipe in Figure 5.6. The end potential (x = 0) is a constant Vo- The dimensions 
and 1 3 have been set equal to l, and x is measured in units of 1. The potential values are 
shown in units of Vq. 


These two examples show how the solution is expressed as a linear combina¬ 
tion of separable solutions. But in each case we used symmetries, and our knowl¬ 
edge of sinusoidal and hyperbolic functions, to simplify the construction of the 
solution. As usual in physics the calculation is not just a routine process, but a 
combination of imagination and analysis. 


5.2 ■ SEPARATION OF VARIABLES FOR SPHERICAL POLAR COORDINATES 


For a system with spherical geometry, it is only natural that we work in polar 
coordinates r,0,4>. That is, the potential function V(x) is written as a function 
V(r, 6 , </>), and Laplace’s equation is 


9 / 2 9VA J_ 1 9 / . dV_\ 1 d 2 V 

r 2 dr \ dr ) r 2 sin# 9(9 \ dO ) + sin 2 (9 dip 2 


(5.43) 


To keep the discussion relatively simple, we restrict our attention to systems with 
azimuthal symmetry, i.e., invariant with respect to rotations around the z axis. 
This restriction still includes many interesting problems. To extend the method 
to problems that lack azimuthal symmetry is possible but more complicated. For 
a system with azimuthal symmetry the potential does not depend on <p\ that is, 
V(x) = V(r, 6 ). Then the last term on the left-hand side of (5.43) is 0. 


5.2.1 ■ Separable Solutions for Spherical Coordinates 

The key to separation of variables is to generate all solutions with the product 
form, as we did for Cartesian coordinates in Sec. 5.1. For the present case a prod¬ 
uct solution is 
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V(r,9) = R(r)®(9). 


(5.44) 


Inserting this form into (5.43) and multiplying the result by r 2 /(R@), we find 


1 d ( 2 dR\ 1 d 

R dr V dr J 0sin 0 dO 



= 0 . 


(5.45) 


The two terms depend on r and 9 separately, and sum to 0 for all r and 9. Clearly 
this is only possible if both terms are constants that sum to 0, i.e., C and —C 
respectively. It is convenient to denote the constant C by £(£ + 1). The functions 
R(r) and 0(0) must satisfy the ordinary differential equations 


57 ( r2 ^) _ ^ e+l)R = °’ (5.46) 

and 

1 d ( . d®\ 

— +£(€ + 1)0 = O - (5-47) 

sinf? dd \ dO J 

Later we shall see that £ must be an integer in order that the solution of (5.47) is 
finite in the whole domain of 9. The quantity 1(1 + 1) is the same for l = n or 
l = — n — 1 , so without loss of generality we may specify that l is nonnegative. 

Two independent solutions of the radial equation (5.46) are r l and r~ f ~ ] , as 
can easily be verified. The most general solution is a linear combination 


R(r) = Ar i + JL (5.48) 

where A and B are constants that will eventually be determined by boundary con¬ 
ditions. The two radial solutions have very different characters. We have specified 
that £ > 0. Therefore the radial function r l is finite at r = 0 but diverges as 
r —>■ oo. The function r~ e ~ l diverges at r = 0 but approaches 0 as r —> oo. 10 
The behavior at r = 0 or r = oo will often dictate that either B or A must be 0. 
A finite solution for the potential in a region that includes the origin must have 
B = 0, and a finite solution in a region that extends to oo must have A = 0. 

The angular equation (5.47) may be cast in the form of Legendre’s equation. 
Rewrite the equation in terms of the variable u = cos 9, and let 0(0) = Piu). 
Note that 

d® dP du . dP j -j dP 

d9 du d9 du du 

Therefore the equation obeyed by P(u) is 

(\-u 2 )~\+£(£ + l)P =0, (5.49) 

du J 

10 A solution of a differential equation that is finite in some domain of the variable is called a regular 
solution. 


d 

du 
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which is Legendre’s equation. The domain of u is (—1, 1). Equation (5.49) has 
two linearly independent solutions for any t. However, we are only interested in 
regw/ar solutions, i.e., that are finite for — 1 <u< 1. As we shall see, the require¬ 
ment that P(u) be regular places severe restrictions on the solutions. Equation 
(5.49) has a regular solution only if t is an integer, and the regular solution is sim¬ 
ply a polynomial of degree £ called the Legendre polynomial. To get acquainted 
with the Legendre polynomials, a brief mathematical digression is in order. 

5.2.2 ■ Legendre Polynomials 

We may construct the solution of (5.49) by writing P(u) as a power series 

OO 

P(u) = J2 c nU n • (5.50) 

n= 0 

Inserting the series into (5.49) and combining the terms with equal powers of u, 
we find 

OO 

[ C n+2 (n + 2)(n + 1) - C„n(n + 1) + C n £(£ + 1)] u n = 0, (5.51) 

n=0 

which must hold for all u. The only way a convergent power series can be 0 for 
all values of the variable is if the coefficients are 0. The fact that the coefficient in 
square brackets must be 0 , for all n, leads to the recursion relation 

C n +2 _ n(n+ 1 ) -l{t + 1 ) 

C n (n + 2)(n+l) ‘ ' 

Given Co the recursion relation determines C 2 , C 4 ,...; or, given C 1 it determines 
C 3 , C 5 ,.... The solutions may be divided into two classes: even functions of 
u, for which C\, C 3 , C 5 ,... are 0; and odd functions, for which C 2 , C 4 , C 6 ,... 
are 0 . 

To use the series (5.50) for the 0-dependence of V(r, 6 ), it must be finite in 
the entire domain 0 < 6 < n. In other words, P(u) must be a regular solution 
of Legendre’s equation, finite for — 1 < u < 1. We shall prove that the series 
must terminate for the solution to be regular. That is, all the C„' s must be 0 for n 
greater than some maximum value n max . Suppose the series does not terminate. 


TABLE 5.1 

Legendre polynomials 


Po(u) = 1 
P\(u) = u 

P 2 (u) = (3 u 2 - l)/2 
P 3 («) = (5 m 3 — 3m)/2 
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Then the recursion relation (5.52) requires C„+ 2 / C n —> 1 as n -* 00 . But in that 
case the coefficients approach a constant as n —> 00 , and the power series will 
diverge at u = ±1, so the solution is not regular. 

By the recursion relation (5.52) the series terminates at n max , giving a regular 
solution, if £ = n max ; because then the right-hand side of (5.52) vanishes for 

n = ttmax- 

The result of the power series analysis is that Legendre’s equation has a reg¬ 
ular solution if and only if i is an integer, in which case P(u) is a polynomial 
of degree L The Legendre polynomials Pe(u) are conventionally normalized to 
have Pe( 1) = 1. We leave it as an exercise 11 to generate the first few Legendre 
polynomials from the recursion relation. Examples are listed in Table 5.1, and 
plotted in Fig. 5.9. Note that the parity of Pe(u) (i.e., the evenness or oddness as 
a function of u) is ( —l/. Some mathematical software packages have Legendre 
polynomials built in, so it is easy to look up higher polynomials. 12 

5.2.3 ■ Examples with Spherical Boundaries 

From the previous section we know how to write a regular, azimuthally symmetric 
solution to Laplace’s equation, in the form of a linear combination of separable 
solutions, 


V(r, 0)=J2 Utr* + Ber-*- 1 ) Pe(cos6). (5.53) 

e=o v 2 


Indeed any regular solution may be expanded in this way, a fact known as the 
11 See Exercise 10. 

12 Also, there are mathematical handbooks, such as Refs. 5-7, that provide information on Legendre 
polynomials and other special functions. 
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completeness theorem of Legendre polynomials. The expansion coefficients At 
and Be will be determined from the boundary conditions of the problem, as illus¬ 
trated in the examples below. 

The first few separable solutions are potential functions that we have encoun¬ 
tered before. For £ = 0 the two solutions are Ao and Bo/r; the first of these is the 
constant that may always be added to the potential, and the second is the potential 
for a point charge at the origin. For l = 1 the two solutions are Air cos 9 and 
B\ cos 9/ r 2 ; the first of these is the potential for a constant field in the z direction, 
and the second is the potential for a point dipole oriented in the z direction. Recall 
that in Sec. 4.3 we analyzed some simple examples using linear combinations of 
just these four solutions. By including all values of £ we can solve any problem 
with azimuthal symmetry. 


EXAMPLE 3 Suppose a spherical surface of radius R is held at a specified po¬ 
tential Vo($), which is an azimuthally symmetric function of the polar angle 9. 
(The sphere cannot be a single conductor, because then it would be an equipoten- 
tial, but it could be built out of sections of a thin conducting shell insulated from 
each other as in the special case considered below.) If there is no charge inside or 
outside the sphere, then the potential function V (r, 6 ) satisfies Laplace’s equation 
V 2 V = 0 for r < R and r > R, with the boundary conditions V(R, 9) = Vq(6 ) 
and V (oo, 9) = 0. This example is a case of Dirichlet boundary conditions. 

We expand V (r, 9) in separable functions, in both the interior and exterior 
regions, as 

00 

Vint (r, 60 = 

t= 0 

00 

V e xt (r,9) = ^ 2 at 
1=0 

Writing the potential in this compact form, we have already used several ideas. 
In the interior (r < R) the radial function for angular mode £ must be i J so 
that V ( r , 9) is finite as r —» 0. In the exterior ( r > R) the radial function for 
angular mode £ must be r~ e ~ l so that V(r, 9) —*■ 0 as r —*■ oo. For convenience 
we have denoted the expansion coefficents by ae/R l and aeR e+l in the interior 
and exterior regions, respectively. The constants at are the same for the interior 
and exterior solutions, because the potential must be continuous at the boundary 
(r = R). The at' s will be determined from the potential on the sphere (r = R) 

oo 

V 0 (9) = £aePe(cos9). (5.56) 

i =o 

The set of Legendre polynomials is an example of orthogonal polynomials. 
With the conventional normalization the orthogonality relation is 


P e (co&9) for r < R, (5.54) 

fR \ i+1 

f— j Pe(cos 6 ) for r > R. (5.55) 
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FIGURE 5.10 Example 3. The upper and lower hemispherical shells are held at constant 
equal but opposite potentials. 


J 'P k (u)P e (u)du = ^ -, (5.57) 

where 8 k ( is the Kronecker delta. Or, in terms of the polar angle, replacing u by 
cos 9, 


[ P k (cos9)Pc(cos6) sin 6 d 6 = (5.58) 

Jo 2 k + 1 

To use the orthogonality to find a k , multiply both sides of (5.56) by P k (cos 6) sin 9 
and integrate 9 from 0 to n. Then the only term in the sum on the right-hand side 
that is not 0 is the one with £ = k, and that term is 2a k /{2k + 1). Therefore , 13 

2 k + 1 f 71 

a k = —-— / P k (cos9)Vo(9) sin9d9. (5.59) 

2 Jo 

This holds for all k, so we can replace k by £ for use in (5.56). In principle the 
problem is solved: Given Vo(9) the aps are known from this integral, and they 
determine V (r, 9) by the series in (5.54) and (5.55). 

As a specific case, suppose Vo (9) is 


Vo (9) = 


+ $0 
—$0 


for 0 < 9 < n/2 
for n/2 < 9 < n 


(5.60) 


as illustrated in Fig. 5.10. This electrostatic system consists of two thin hemi¬ 
spherical shells of a conducting material, glued together to make a sphere but 
insulated from one another by a thin insulating layer, and held at equal but oppo- 

13 Colloquially, we refer to this kind of calculation as “projecting out the Lth coefficient.” 
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FIGURE 5.11 Legendre expansion for the potential (5.60). Two truncations of the se¬ 
ries (5.56) are plotted, with maximum t of 1 and 9. The infinite series is a discontinuous 
function equal to +<I>o fort? in (0, rr/2) and — <I>o fort? in (jr/2, n), shown as the dashed 


site potentials. The expansion coefficient for angular mode f is 


at 


21 + \ 




f 1 

r 0 

/ Pe(u)du — 

/ Pi(u)du 

Jo J 

'-1 


(5.61) 


If i is an even integer, then a( = 0 because Pf(u) has even parity, so that the 
integrals over (0, 1) and ( — 1,0) cancel. If i is odd, then P{ (u) has odd parity, so 
the two integrals add. To evaluate the integral Pfdu, integrate both sides of 
Legendre’s equation (5.49) over the range 0 < u < 1. The result is 14 


f 


Pdu) du 


pm 

ni+ 1 )’ 


(5.62) 


where P' t ( 0) means the derivative of Pf+u) evaluated at u = 0. So, for odd integer 
values of l. 


at 


21+1 
1(1 + 1 ) 


^(0)<fo- 


(5.63) 


The first three expansion coefficients in the series of (5.54) and (5.55) are 

ai,a 3 ,a 5 = §<t> 0 , — g4>0, ^4>0- (5-64) 


For r near R, V(r,9) resembles the function Vo(0), being « +4>o in the up¬ 
per hemisphere and % — 4>o in the lower hemisphere. Figure 5.11 shows how the 
Legendre expansion (5.56) produces this discontinuous function of 9. The func¬ 
tions plotted there are the first term in the expansion, which is just 1.5 cos 9 ; and 
the sum of the first 10 terms, which resembles the step potential Vo(0) except for 
some oscillations. To get rid of the oscillations requires the full infinite series. 
Terms with large values of l are important if r is near R. But for r R or r R 

14 See Exercise 10. 
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the series is dominated by the single term with l = 1. In particular, the asymp¬ 
totic potential for large r is the first term in the series (5.55), which is the dipole 
potential 


ai/? 2 cos# p cos 9 

r 2 47reor 2 ’ 


the dipole moment of this bisected sphere is pk, with 

p — AneoaiR 2 = 6 ne^oR 2 . (5.65) 

The terms with higher values of l, which are proportional to r~ 4 , r~ 6 , r~ 8 , 
are the higher multipole contributions to V (r, 9). 

Now that we have found the exterior and interior potentials for the hemi¬ 
spheres, it is a natural next step to calculate their capacitance C, which has an 
interesting result. Recall that C is the ratio of the charge on one conductor to the 
potential difference between them. The potential difference here is 2<t>o- We’ll ob¬ 
tain the charge by integrating the charge density a (9) over the upper hemisphere. 

Use the general boundary condition a = eo(£i« — £ 2 *)- In this case, the 
normal direction is radial, so 


o{9) = e 0 


9 Text 3 Vint' 
dr dr 


using the Legendre expansions (5.54) and (5.55) gives 


(5.66) 


a(9) = 


eo'J’o 

R 


E 

{.odd 


( 2 1 + l ) 2 . 

2 -— P (0) Pi (cos 9). 

l{l+ 1 ) ' 


(5.67) 


The total charge on the upper hemisphere is then 

Q = 2nR 2 I a(9)sin9d9. (5.68) 

Jo 

To do the integral, let cos 9 = u and refer back to (5.62). Finally the capacitance 
is 


C = J-= 7re 0 R T 

2 °o 6 dd 


21 +1 

LW + U 


-|2 


pUo) 


(5.69) 


Although the separation between the hemispheres is infinitesimal at the equator, 
C is finite. In contrast, a parallel plate capacitor with infinitesimal plate separa¬ 
tion would have infinite C. The reason C is finite for the hemispheres is that we 
approximated the thickness of the shells by 0, so that the area of infinitesimal 
separation is 0. 
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Because of spherical symmetry, we have been able to do a lot with this prob¬ 
lem. Indeed all of our analytic techniques require some degree of symmetry for 
the problem to be tractable. Even small changes in the system could make the 
problem much harder, or impossible, to solve analytically. For example, if the 
hemispheres were displaced horizontally, or tipped relative to one another, the 
Legendre expansion would not suffice to solve the problem. To find V (x), E(x) 
and C for such unsymmetric problems would require numerical methods or direct 
measurement. 


EXAMPLE 4 Consider again a sphere, but now instead of specifying the potential, 
specify the charge on the surface. Let the azimuthally symmetric surface density 
be oo(9). What is the potential V(r, 0)1 

The same expansions in separable functions (5.54) and (5.55) can be used, be¬ 
cause they are general solutions of Laplace’s equation, but we must recalculate the 
coefficients . The boundary condition in this case is that the normal component 
of the electric field is discontinuous at the sphere, with discontinuity ao(9)/eo', 
that is, 


3 V ex , 

, 3V in ,l| 

3r 

+ ^rjl 


go(fl) 
eo 


(5.70) 


the same as (5.66) with surface density cro(O). This case is an example with Neu¬ 
mann boundary conditions. Inserting the expansions (5.54) and (5.55) into the 
boundary condition gives 


00 

£(2* + 1)-£p<(cos#) 
e=o K 


gp(0) 

eo 


(5.71) 


As before, we use the orthogonality relation (5.58) to project out the Ath coeffi¬ 
cient from the expansion, and here the result is 

r r n 

ak = — / ao(0)Pk(cosd) sinOdO. (5.72) 

2eo Jo 

This expression is valid for any k so we can replace k by i for use in (5.54) and 
(5.55). 


Special case. Suppose the surface density is ao(9) = a cos 9, where a is a 
constant. The total charge for this distribution is 0, with positive charge above the 
equator and negative charge below. The surface density is large and positive at the 
north pole, large and negative on the south pole, and zero at the equator of the 
sphere. Because cos# = Pi (cos#), the orthogonality of Legendre polynomials 
implies that ak = 0 for k 1, and a\ = a R/ (3<?o). The potential function is 

Vjnt(r, #) = -^—r cos# for r < R, 

3e 0 


(5.73) 
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FIGURE 5.12 Example 4. The curves are electric field lines inside and outside a sphere 
with surface charge density a cos 6 . The field is constant inside the sphere and a pure dipole 
outside. 


a R 3 

V« t (r, 0) = - - T cos <9 for r > R. (5.74) 

3e 0 r 2 

The electric field inside the sphere is uniform, equal to —ka/(3eo). The electric 
field outside the sphere is a pure dipole field, with dipole moment p = |jr /? 3 nk. 
The electric field lines for this system are shown in Fig. 5.12. Note that the normal 
component of the field is discontinuous at the surface. 

In Example 4 of Chapter 4 we calculated the surface density of the charge 
induced on a conducting sphere placed in a uniform applied field £ok. The result 
for that example is a(Q) = 3eo£ocos0. If in our current example we set a = 
3« <)£(), then the field produced inside the sphere would be — E’ok. The electric field 
inside a conducting sphere in an applied field is the superposition of the applied 
field Eok and the field produced by the surface charge a cos 6 ; the combined field 
is 0, as the field must be in a conductor. 


EXAMPLE 5 The Coulomb potential. Consider a point charge q displaced from 
the origin. We may set up a coordinate system with the charge on the z axis, as 
shown in Fig. 5.13. What is the potential due to this displaced charge in spherical 
polar coordinates? 

The potential V(x) will be independent of the azimuthal angle cp of the field 
point x, and it will satisfy Laplace’s equation everywhere except at the singular 
point x' where q is located. By the completeness of Legendre polynomials, we 
can write the potential in spherical coordinates as a Legendre series 

OQ 

V(r, 6) =y^ Ri(r, r')P e (cos0), 
e =o 


(5.75) 
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FIGURE 5.13 Example 5. The potential at x due to a point charge q at x' on the z axis 
can be expanded in Legendre polynomials when expressed in spherical polar coordinates 
(r, 8). The distance from q to the field point is |x — x'| = \/r 2 — 2 rr' cos 8 + r' 2 . 


where r = |x| and r' = |x'|, and Ri(r, r') is a function we will determine. Of 
course we know the potential; it is just the Coulomb potential 


V(x) = 


q 

4:reo|x — x'| 


_q _ i _ 

Vr 2 — 2 rr' cos 6 + r' 2 


(5.76) 


The question is, what are the radial functions Rt(r, r') in the Legendre expansion? 
To answer the question we need a famous function in the theory of Legendre 
polynomials. 


The Generating Function of Legendre Polynomials 

We begin somewhat indirectly by defining, as a purely mathematical object, a 
function of two variables, by 


00 

g(u,s) = '^2,Pi{u)s t . 


£=0 


(5.77) 


In words, it is a power series in .y whose coefficients are the Legendre polynomials. 
In general s may be complex, but that is not important here. The function g(u, s) is 
calledthe generating function because the Legendre polynomials can be generated 
from it by differentialion; note that 


- n! P n (u). (5.78) 

s= 0 

If we find a simple expression for g(u, s) then it determines in principle all the 
P,MYs. Now, g(u, s) satisfies a certain partial differential equation. By multiply¬ 
ing Legendre’s equation (5.49) by s e and summing over £, note that 



9 

du 



dg' 

du 


= -J2Uf + l )Pe(u)s e ; 
t 


(5.79) 
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but the right-hand side may be reexpressed as 




3 u |_V ' 3mJ 


a 2 

+ S 3? <IS)= °' 


(5.80) 


Thus g(u, s) obeys the partial differential equation 

(5.81) 

Also, there are some simple boundary values of g(u, s). From (5.77) we see that 


g(u, 0) = P 0 (u) = 1, 

7 p(«, 0) = P\(u) = u, 
ds 

£(1> s ) = XX = T - "—’ 

to l ~ s 

00 Y 

g(-l,i) = X)(- 1 )V = —. (5.82) 

1 + s 


For the last two equations |s | must be < 1 in order that the series converge. 
The solution to the equation (5.81) with the boundary values (5.82) is 


g(u, s) 


1 

vT— 2 us + s 2 


(5.83) 


To verify that this function satisfies (5.81) is straightforward. The domain of va¬ 
lidity of the expansion (5.77) is |.s| < 1, and —1 < u < 1 because we associate u 
with cos 0. The fact that |j| must be < 1 will be important; if |.v | > 1 then (5.77) 
is a divergent series. 

The Legendre polynomials P( (u) and their generating function g (u, s ) are just 
one example in a general mathematics of orthogonal polynomials. Other exam¬ 
ples, which a physics student will certainly encounter in quantum mechanics, are 
the Laguerre polynomials and the Hermite polynomials [5-7]. 


We are now ready to return to Example 5, and write the Coulomb potential in 
Legendre polynomials; that is, we will find the radial function R( (r, r') in (5.75). 
For r < r'. 


1 


Ix-x'l 


l_ _1_ 

r ' y^l^^TTr^ycos^+TrT^O 2 


1 

? 


g(cos6, r/r ). 


(5.84) 


We have factored out 1/r' so that the variable r/r' in the square root is < 1, 
which is the condition needed to write g(cos6, r/r') as the series in (5.77) with 
s = r/r'. Similarly, for r > r', we have 
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--- =- - - Ipfcosfl, r'/r). (5.85) 

l*-x'l r yi -2(r'/r)cose + (r'/r) 2 r 


Now substitute the expansion (5.77) into (5.84) or (5.85). Comparing to (5.75) we 
see that the radial function Rt(r, r ) is, for r < r'. 


Rt(r, r') 


47T€q (r') M ’ 


and for r > r', 


Rdr, r') 


q (r') e 
4^-eo r f+1 ’ 


(5.86) 


(5.87) 


Note that R( (r, r') is symmetric with respect to the interchange of r and r’ because 
|x — x'| has that symmetry. Our final expression for the Coulomb potential in terms 
of Legendre polynomials is 


V(x) 


q 

4rre 0 


OO 


£ 


Pd cos 8) 


d/{r') lJr 1 
(r') e /r M 


for r <r' 
for r > r'. 


(5.88) 


The series is convergent for all points except (r, 8) = (r', 0), the singular point 
where q is located. 

We derived (5.88) by taking x' to be on the z axis; then 8 is the polar angle 
coordinate of x with respect to that axis. However, the equation holds for arbitrary 
x' and x, i.e., independent of the choice of axes, if 8 is the angle between the 
vectors x and x'. 

It was stated earlier that any azimuthally symmetric solution of Laplace’s equa¬ 
tion may be expanded in Legendre polynomials. Equation (5.88) is an example of 
that statement. Furthermore, 1 /(4 tt |x — x'|) is the Green’s function G(x, x') of 
the Laplacian, which is used to construct solutions of Poisson’s equation for a 
specified charge density. 


5.3 ■ SEPARATION OF VARIABLES FOR CYLINDRICAL COORDINATES 

When a problem has cylindrical symmetry the natural coordinate system to use is 
cylindrical coordinates r, (p, z- 15 This is appropriate if, for example, the equipo- 
tential surfaces or the surfaces on which the charge is known are cylinders. In 
these coordinates Laplace’s equation, satisfied by the potential V(x) = V (r, cp , z), 
is 


1 9 / dV \ 1 9 2 V a 2 v 

r dr \ dr ) ^ r 2 dtp 2 dz 2 


(5.89) 


15 Whether to denote the angular variable by 0 or by 8 is a matter of convention. In this section we 
follow the notation of Chapter 2, and call it 4 >. 
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We will restrict our discussion to problems that are translationally invariant in the 
z direction; then V does does not depend on z. 

5.3.1 ■ Separable Solutions for Cylindrical Coordinates 

As before, we begin by considering the separable solutions. In cylindrical coor¬ 
dinates, with no z dependence, these take the form 

V(r,(P) = fl(r)<t>(0). (5.90) 

Substitute (5.90) into (5.89), and note that the term d 2 V/dz 2 is 0. Then multiply 
the result by r 2 /R(r)<t>((p ) in order to separate terms that depend only on r from 
those that depend only on (p. The result is 



This equation must hold for all r and (p, and these are independent variables, 
so each term must be constant. We also add the requirement that the potential 
be periodic in (p with period 2n, i.e., we require V(r,<p + 2 it) = V(r, <p). The 
periodicity implies we must set 


1 d 2 <t> 


(5.92) 


with n a nonnegative integer. The general solution for the <p dependence is 


<$>(<p) = C„ cos n<p + D n sin ncp, 


(5.93) 


where C n and D n are arbitrary constants. 

We do not need to allow n < 0 because negative values are already included 
in the form (5.93). The case n = 0 is somewhat special. It would include <t>(<p) 
a constant, which is relevant to problems with azimuthal invariance. A general 
solution with n = 0 is = a<p + b, which is not periodic, but is relevant to some 
problems. 16 

Next, to find the form of R(r) we replace the second term in (5.91) by —n 2 . 
Then the radial equation may be rewritten in the form 


,d 2 R 
dr 2 


dR 

dr 


(5.94) 


The general solution to this second-order ordinary differential equation is 


R(r) = A n r n + B„r~ n (5.95) 

,6 The potential V(<j>) = Vo0/0o is the potential in the wedge-shaped volume between the planes 
d> = 0 and 0 = 0o >f the plane 0 = 0 is held at ground potential and the plane 0o is held at potential 
Vo - In this case the potential isn’t required to be periodic because the range of 0 is limited. 
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where A n and B n are additional arbitrary constants. To prove that this is the 
solution, just substitute it into (5.94). In a particular application, the constants 
A n , B n , C n , and D n will be determined by boundary conditions. The case n = 0 
is again special. Its solution is R = A In r + B. We have met this important solu¬ 
tion before as the potential of a uniform line charge. 

Finally then we have the most general periodic solution for V (r, cp), 

oo 

V(r, cp) = A lnr + B + ^ ( A n r n + B n r~ n ) ( C n cos ncp + D n sin ncp). (5.96) 

n =l 

The cp dependent functions are called zonal harmonics, and they deserve some 
general discussion. For systems in which the potential is by some symmetry even 
in (p, we only use the functions cos n<p; that is, D n = 0. Similarly, for systems in 
which the symmetry is odd in cp, we only use the functions sin n<p. If the system 
has neither symmetry, then use the linear combination (C„ cos ncp + D n sin n<p). 

The terms with n = 1 and finite at the origin are proportional to rcoscp and 
r sirup, which describe uniform fields in the x direction and y direction, respec¬ 
tively. The field E = Eq i has potential function —E^r cos (p = —Eqx\ similarly 
E = £o j has potential function — E^r sin <p = —Eoy. Terms proportional to r" 
are useful for interior regions, i.e., which include the origin, because r n is non¬ 
singular at r = 0. Terms proportional to r~ n are useful for exterior regions, i.e., 
which include r = oo. In an annular region R\ < r < R 2 , which excludes r = 0 
and r = 00 , the linear combination (A n r n + B n r~ n ) must be used. 


EXAMPLE 6 Figure 5.14(a) shows two half-cylinder shells that are the right and 
left halves of a cylinder with radius R. The shells’ thickness is negligible com¬ 
pared to R, and they are made of a conducting material. They are separated from 
each other step = n/2 and <p — 3n/2 by small insulating gaps. The left half, for 
which n/2 < <p < 3n/2, is held at potential — Vo, and the right half, which has 
0 < <p < 7t/ 2 or 3 tt/2 < <p < 2n, is held at +Vo- We will find the potential 
function V (r, cp) inside and outside the cylinder. 

We start with (5.96) but needn’t use all the terms. Because of the left-right 
antisymmetry, the net charge on the cylinder is 0, and this means the term A In r 
does not appear in V (r, cp). The system has even symmetry in cp, so the terms with 
sin ncp do not appear. In the interior the terms oc r~ n do not appear, because they 
would be singular at r — 0, and in the exterior the terms oc r" do not appear. 
Thus, for the interior (r < R) and exterior (r > R), respectively, the potential is 

OO 

Fint (r, <P) = ^2 Anr n cos ncp (5.97) 

«=1 

OO 

V ext (r, <p) = '^2 B n r~ n cos ncp. 

n =1 


(5.98) 
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FIGURE 5.14 Example 6. (a) Two thin conducting half cylinders, the right held at po¬ 
tential +V 0 an d the left at potential —Vq. The shells are separated by insulating gaps at 
<j> — n/2 and <j> — 3n/2. (b) The potential at the surface r = R of the bisected cylinder. 


+ V 0 

-Vo 


Here is an often-used mathematical shortcut. (We used it in spherical coordi¬ 
nates in Example 3, and now we’ll use it in cylindrical coordinates.) At r — R 
we have the boundary condition Vi„t (R, 0) = V eu (R, 9), which implies A n R n = 
B n R~ n = c n , a new constant for each n. Then the series take the more convenient 
form 


oo 

Vint(r,<A) = (^) 

n— 1 

n 

cos n(j) 

(5.99) 

°° / R \ 

n 


Vext( r, (p) = ^ C n ( — 
n=l ' ' 

On the surface of the cylinder, 

00 

| COS 71(f). 

(5.100) 

V(R , 0) = c n cos n(p. 

n = 1 

(5.101) 


Equation (5.101) is a Fourier expansion of the periodic potential shown in Fig. 
5.14(b). We obtain the coefficients c„ in the usual way from an orthogonality 
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relation. Multiplying both sides of (5.101) by cos m<p and integrating from 0 = 0 
to 0 = 2 jt gives 



V(R,<p) cos w0d0 = c m n 


(5.102) 


because the only term in the series that contributes to the integral is the term with 
n — m, by the orthogonality relation 


p2 7t 

I cos n<p cos m<p d(p = n8 nm . (5.103) 

Jo 

Now we substitute the function V ( R , 0), which is piecewise constant (—Vo for 0 
in (n/2, 3n/2) and + Vq in the rest of the domain of integration), and after a little 
algebra we find 


Cm 


4 Vo . inn 

-sin—. 

nm L 


(5.104) 


Thus c m is 0 if m is even; the sign is + for m — 1, 5, 9,..., or the sign is — for 

m = 3, 7, 11_To simplify the notation we will write n = 2j + 1 and sum over 

j in the series (5.99) and (5.100). 

The final series for the complete potential are 


Vim (r, 0) 


4Vo (-1 )J 

* h 2j+1 


r \2;+l 


© 


cos(2 7 + 1)0 


Vext(r> 0) 


4V 0 ” (-1)2 

" h 2j+1 


(*r 


cos(2 j + 1)0. 


(5.105) 

(5.106) 


5.4 ■ CONJUGATE FUNCTIONS IN 2 DIMENSIONS 


An electrostatics problem in which the potential is translationally invariant in one 
direction, say the z direction, is called a two-dimensional (2D) problem. The po¬ 
tential is then V(x, y) and Laplace’s equation is 


d 2 v a 2 v 

dx 2 dy 2 


(5.107) 


Although (5.107) is an idealization—it implies the system is infinite in the z 
direction—it is a good approximation for many real electrostatic systems in which 
the extent of one dimension (z) is much larger than the others (x, y). 

Some 2D problems can be solved by separation of variables, e.g., in Cartesian 
or polar coordinates. However, there is another approach for 2D problems, called 
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the method of conjugate functions, that solves Laplace’s equation by combining 
x and y in a complex variable. This section describes the method and applies it 
to several problems in which we find V(x, y) for charged 2D conductors. The 
mathematics is intricate, but it yields interesting results. 

The central idea is to find two functions g(x,y) and h(x, y) that are conjugate 
to one another—a relationship we will explore. Once found, one function is the 
potential and the other describes the field lines, which are everywhere perpendic¬ 
ular to the equipotentials as always. The method is based on the mathematical 
theory of functions of a complex variable, and it brings to bear on electrostatics 
some of the mathematical elegance and power of this large subject [1,2]. 

The general 2D problem is to find V(x, y) for specified boundary conditions. 
Picture the coordinate plane as the plane of complex numbers, by writing 17 


Z = x + iy 


(5.108) 


where i = V—1. In words, z is a complex number with real part x and imaginary 
part y. The method of conjugate functions is based on the following theorem: If 
F(z) is an analytic function of z = x + iy, then Re F(z) and Im F(z.) satisfy 
Laplace’s equation (5.107). The theorem provides a method for generating solu¬ 
tions of Laplace’s equation. To solve a given boundary value problem it is “only” 
necessary to find the right function F(z), namely one whose real part or imagi¬ 
nary part satisfies the boundary conditions. This may not be as easy as it sounds, 
and indeed the main difficulty is to find the appropriate F(z). 

The theorem is not difficult to prove. Write F(z) = g(x, y) + ih(x, y), where 
g and h are real functions. We need to show that g and h are solutions of (5.107). 
By the chain rule of differentiation, 


similarly, 


Thus, 


— - — and — - —■ 
dx dz 3y dz' 


3 2 F _ d 2 F 3 2 F _ d 2 F 

dx 2 dz 2 3 y 2 dz 2 

V 2 _ 3 2 F 3 2 F _ d 2 F <PF_ _ 

dx 2 3y 2 dz 2 dz 2 


(5.109) 


(5.110) 


(5.111) 


that is, F(z) satisfies Laplace’s equation. Because V 2 is a real operator, both the 
real part g(x , y) and the imaginary part h(x , y) of F(z) satisfy Laplace’s equation. 

We may use either g(x, y) or h(x , y) as the potential of an electrostatics prob¬ 
lem, provided it satisfies the boundary conditions. Whichever we use, the other 


17 It requires some mental agility to not confuse the complex variable z = x + iy with the Cartesian 
coordinate Z- But the latter z never appears in a 2D problem. 
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function will tell us the electric field lines. Suppose g(x, y) is the potential. Then 
the equipotentials are curves on which g(x , y) is constant. We shall prove that the 
curves on which h(x, y) is constant are the electric field lines. 

Derivatives of g(x , y ) and h(x, y) are intimately related because F(z) is not a 
function of x and y independently, but of the special combination x + iy. Note 
that 


dF 

dx 

dF 

dy 


dg d_h 

dx dx 

dg ah 

dy 1 dy 


dF 
dz ’ 
dF 
1 dz 


(5.112) 


(5.113) 


Multiplying the second equation by —i, and then equating the real and imaginary 
parts of the result to those of the first equation, we see that 


dg dll dg dll 

dx dy dy dx ’ 


(5.114) 


These coupled partial differential equations are called the Cauchy-Riemann equa¬ 
tions. Any pair of functions g(x , y) and h(x , y) that satisfy them are called conju¬ 
gate functions. The real and imaginary parts of a complex function of z are always 
conjugates. 

If g(x , y) is the potential of an electrostatics problem, then the equation for an 
equipotential curve is g(x, y) = go. and so the displacements dx and dy along 
the curve are related by 


dg dg 

dg = f-dx + /dy = 0; 
ox dy 

the slope of the curve in the xy plane is 

/dy\ dg/dx 

\dx) g dg/dy ' 

Similarly, the slope of the curve h(x, y) = ho, a constant, is 

(dy\ dh/dx = dg/dy 

\dx) h dh/dy 'dg/dx' 


(5.115) 


(5.116) 


(5.117) 


where the second equality follows from the Cauchy-Riemann equations. Compar¬ 
ing the two slopes shows that the two curves are orthogonal where they cross, 
because the slope of one is the negative reciprocal of the slope of the other: 


dy\ -1 

dx) h ( dy/dx) g 


(5.118) 


In conclusion, the family of curves with h(x , y) constant are the electric field 
lines—curves everywhere orthogonal to the equipotentials. 
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We will now apply these ideas to three problems that are physically interesting, 
and are quite different from any problems studied so far in that they have less 
symmetry. The solutions are far from obvious, but can be found with conjugate 
functions. 


EXAMPLE 7 What is the potential function V(x, y) for a region bounded by 
charged conducting plates that intersect at a right angle? The plates are in con¬ 
tact, so they have the same potential. 

The geometry is shown in Fig. 5.15. The half of the xz plane (y = 0) with 
x > 0 is one boundary, and the half of the yz plane (x = 0) with y > 0 is the 
other boundary. Assume that the plates extend a large distance in z, so that the 2D 
approximation is valid. The boundary condition is that V (x, y) is a constant on 
the two half planes, because the plates are conductors. We will specify V = <J>o 
on the boundary, but the precise value of Oo is not particularly relevant because 
we can always add a constant to the potential. 

The complex function that solves this example is 


F(z) = Az 2 + i< t>o = A(x 2 — y 2 ) + liAxy + i< l>o, (5.119) 



FIGURE 5.15 Example 7: Perpendicular charged conducting plates. The system is 
translation invariant in the z direction. Ideally the plates extend to oo in the positive x 
and y directions. The plates, at potential 4 >q, are half-planes shown on the positive x and y 
axes. The solid curves are electric field lines and the dashed curves are equipotentials. 
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where we use the imaginary part for the potential 

V(x,y) = 2Axy + <P 0 - (5.120) 

To motivate this choice of F(z), note that z 2 is real on both the x and y axes (but 
nowhere else) so Im F is a constant, <J>o, on the axes. But that’s just the boundary 
condition we want for V. So by taking V = Im F we have V (0, y) = V (x, 0) = 
<J>o as required. The constant A will be interpreted later. 

The equipotentials are curves with xy = constant, which are hyperbolas with 
asymptotes along the axes (the dashed curves in Fig. 5.15). The electric field lines 
are curves with x 2 — y 2 = constant, which are hyperbolas with asymptotes at 45° 
to the axes (the solid curves in Fig. 5.15). 

The constant A is related to the surface charge density a on the conductor. On 
the boundary a is €qE„. For example, on the horizontal conducting plate (y = 0) 
the surface charge density is 


er(x) = €oE y (x, 0) = — 2Aeox. (5.121) 

The density is 0 at the intersection of the plates, and increases in magnitude lin¬ 
early with x. The charge on the vertical plate is analogous. Negative A corre¬ 
sponds to positive charge on the plates; in this case the electric field lines flow 
away from the conductors, terminating on negative charges. 

Previously, we studied a charged plane. In that most symmetric problem E is 
uniform, in the normal direction, and a is constant. Now imagine that the plane is 
bent in a right angle. The result is the problem we have just solved. The field and 
charge density are nonuniform. 

We have taken the plates to be infinite in the positive x and y directions, which 
is why the total charge is infinite. In any real system, the extension of the plates in 
the x and y directions would be finite, the amount of charge would be finite, and 
the potential would approach 0 at infinity. The constant <J>o that we are using in 
this example is the potential difference between the plates and infinity. For finite 
plates, Fig. 5.15 would be accurate in the region near the intersection and far from 
the outer edges. 18 


EXAMPLE 8 What is the potential function V(x,y) near the edge of a charged 
rectangular conducting plate? 

The geometry is shown in Fig. 5.16. The plate lies on the half of the xz plane 
(y = 0) with x > 0. The potential must be a constant <J>o on this boundary, 
because it is a conductor. The complex function that solves this example is 19 

18 See Exercise 19. 

19 For mavens of complex analysis, the branchcutof t}F is along the positive real axis. See [1]. 
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FIGURE 5.16 Example 8: Electric potential and field at the edge of a charged conduct¬ 
ing plate. The plate is the half-plane y = 0, x > 0, shown on the positive x axis. The solid 
curves are electric field lines and the dashed curves are equipotentials. 


F(z) = Az 1/2 + 


(5.122) 


where the imaginary part of F(z) is the potential. To motivate this choice of F(z), 
note that z 1 ^ 2 is purely real on the positive x axis, so Im F is a constant, d>o, there. 
That is exactly the boundary condition we want for V. 

Some complex analysis is required to express F(z) as a function of x and y. 
Write z = re' 6 where tan 6 = y/x, and r = jx 2 + y 2 ; then 

F(z) = Ar l/2 e ie/2 + i<P 0 

= A(x 2 + y 2 ) 1 / 4 [cos(9/2 + i sin#/2] + i<t> 0 (5.123) 

= g(x,y) + iV(x,y). (5.124) 


Using trigonometric identities it can be shown that 


A 


- \ 1/2 

x 2 + y 2 — x ] + d>o 


1/2 


x 2 + y 2 +x 


(5.125) 

(5.126) 


Note that the potential V (,r, 0) on the xz plane (y = 0) is <t>o for x > 0, but 
Ay/\x\ + $o for.r < 0. The equipotential curves are parabolas, shown in Fig. 5.16 



5.4 Conjugate Functions in 2 Dimensions 


169 


as dashed curves. The electric field lines are also parabolas, shown as solid curves. 
Because we have taken the plate to be infinite in the +x direction, y increases 
monotonically with x on an equipotential curve. For a plate that is finite in the 
x direction, i.e., a flat conducting strip along the z axis, the equipotential would 
curve back to the xz plane on the other side of the plate. The function F(z) of 
(5.123) is for a semi-infinite plate. The problem of a finite strip can also be solved 
by conjugate functions, but that is more complicated. 

It is interesting to calculate the surface charge density on the two sides of 
the conductor, from a = eo E„ , where E„ is —dV/dy on the upper surface and 
+3 V/dy on the lower. The charge density on the upper surface is 

er(.r) = eo lim E y (x, y) = — (5.127) 

y-»0+ ' V4x 

Some care is required in taking the limit. The charge density on the lower surface 
is the same. The calculated charge density approaches infinity at the edge of the 
plate x = 0, because we have neglected the thickness of the plate. For a plate 
of nonzero thickness the charge density would level off at a finite value as x 
decreases to 0. The relation a oc. x~ 1 ^ 2 would be a good approximation for a plate 
of finite length in the x direction (a strip) near the edge at x = 0. 

For an infinite charged plane—a highly symmetric system we studied pre¬ 
viously—there are no edge effects because there are no edges. Here we have 
calculated the edge effects for a semi-infinite plane! 


EXAMPLE 9 Fringing field of a parallel plate capacitor. Our final example is 
more complicated mathematically. Consider the function 20 


F(z) = *7— In ui = V(x, y) + ih(x, y), 
in 


where z is related to w by 


z 


lnut + -(1 — ur) 


(5.128) 


(5.129) 


Both w and z are complex variables. As usual z is x + iy. The other complex 
plane corresponds to w = u + iv, with horizontal axis u and vertical axis v. The 
transformation w —> z is an example of conformal mapping from the (w, v) plane 
to the (x, y) plane. 

In this example the real part of F(z) is the potential function V (x, y). There 
is no way to solve for w as an explicit function of z, so the functional relationship 
between F and z, or between V and x , y, is parametric. Nevertheless we can 

20 The branch cut of In w is along the positive real axis. The branch cut of a function of a complex 
variable is a curve along which the function has a discontinuity. 
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analyze the dependence and deduce what boundary value problem has F(z) as the 
solution. In a way we are working backwards, from the solution to the problem. 21 

We need to connect the coordinates x and y to the potential V(x, y) and the 
intensity function h(x, y). Write w = pe ,e . Then using In w = Inp + i9 and 

(5.129) it is straightforward to express x and y as 


d 

- 

1 / 

9 \ 

— 

In p 

+ - (1 - P 

2 cos 29 ) 

TC 

_ 

2 V 

/ 

d 


1 2 • 


— 

9 - 

-p l sin 29 


7T 


2 



(5.130) 

(5.131) 


The potential and intensity functions may also be written in terms of p and 9, as 

9 V 0 

V = Vo- and h = - — \np. (5.132) 

7T 7T 

Through these equations we have a parametric relation between (x , y) and V. 

The range of 9 is 0 to 2n, and the range of p is 0 to oo. Curves in the xy 
plane with constant p have constant h(x,y); these curves are the electric field 
lines, shown as solid curves in Fig. 5.17. These curves are generated by fixing p 
in (5.130) and (5.131) and varying 9 from 0 to 2n; the point (x, y) traces out a 


y 



FIGURE 5.17 Example 9: Electric potential and field at the edge of a parallel plate 
capacitor. The solid curves are electric field lines and the dashed curves are equipotentials. 

21 There is no simple way to motivate the function F(z ) so this is an “exploratory” example. However, 
the conformal map (5.129) can be analyzed (Exercise 23) to see why it is related to parallel plates. 
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curve. Curves with constant 6 are the equipotentials, shown as dashed curves in 
Fig. 5.17, generated in a similar way. The plots of curves in Fig. 5.17 are examples 
of parametric plots; y and x depend parametrically on the variables p and 8. As 
p varies with 8 fixed, the point (x, y) follows an equipotential in the xy plane; as 
8 varies with p fixed, the point (jc, y) follows an electric field line. 

Points with y = 0 and x < 0 have parametric variables 6 = 0 and 0 < p < oo; 
i.e., w is real and positive. The potential V (x, y) on this boundary is Oby (5.132). 
Points with y = d and x < 0 correspond to parameter values 8 = n and 0 < p < 
oo; in this case w is real and negative. The potential V (x , y) on this surface is Vo 
by (5.132). 

Now we can see what electrostatics problem is solved by the function F(z) in 
(5.128). The system is a parallel plate capacitor with potential difference Vo. for 
which the plates are half-planes parallel to the xz plane, at y = 0 and y = d. The 
edge of the capacitor is at x = 0, and the plates extend to — oo in the negative x 
direction. The result in Fig. 5.17 is a good approximation for finite plates if their 
length is much larger than the separation d. 

In our previous calculations involving parallel plate capacitors we neglected 
the edge effects. Here, however, we are calculating the edge effects! In the region 
deep between the plates (x —d) the electric field is approximately constant 
— j Vo/d. Near the end (x = 0) however, the electric field bulges out into the 
unbounded region, as illustrated in Fig. 5.17. The field at the edge is called the 
fringing field. Figure 5.17 illustrates the directions of the fringing field but not the 
magnitudes. The field on the midplane, y = d/2, is 

E y = -9V/9y = +dh/dx. (5.133) 

It can be shown 22 that E y (x,d/ 2) is proportional to 1 /x as x —*■ oo. 

The mathematics of conjugate functions is highly developed, both for its prac¬ 
tical applications to 2D potential problems and for its mathematical beauty. Con¬ 
formal maps such as the Schwarz-Christoffel transformations solve a great variety 
of interesting problems [2]. 


Examples 7-9 have presented, without much motivation, the functions that 
solve Laplace’s equation for the appropriate 2D boundary conditions. The reader 
may well object that the solutions in Examples 7-9 were “pulled out of a hat like 
a magician’s rabbit” rather than constructed logically. Of course the uniqueness 
theorem guarantees that the functions we have written are the correct, unique 
solutions. But where did they come from? 

There are formal techniques for finding the F(z) whose real or imaginary part 
gives the potential for a given 2D problem. These techniques have a certain kind 
of beauty, but they are beyond the scope of this book. 23 One trick is to construct a 

22 See Exercise 39. 

23 A famous discussion is Chapter IV of Smythe, Ref. 2. 
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conformal transformation. The idea is to start with a 2D problem whose solution 
is known and then transform it, using conformal mapping, into the arrangement 
of conducting boundaries in the problem to be solved. Dictionaries of conformal 
maps can be consulted. 24 For example, one 2D problem whose solution is well 
known is the potential of an infinite conducting plane. If one folds the plane to 
an angle of 90°, the new boundary is that of Example 7. The configuration of 
Example 8 results from folding the plane back on itself, i.e., by 180°. 

Although formal methods for constructing solutions exist, perhaps it is worth 
repeating that in physics many results owe more to inspired guess-work than to 
routine calculation. 


5.5 ■ ITERATIVE RELAXATION: A NUMERICAL METHOD 

The use of computers is now ubiquitous in physics, so our discussion of Laplace’s 
equation would not be complete if it did not include a numerical method. In fact 
there are many numerical techniques for solving boundary value problems on 
computers. We will consider one method, in which the potential is calculated ap¬ 
proximately on a lattice of discrete points by solving a finite difference approxi¬ 
mation of the partial differential equation. The method is not as powerful as other, 
more advanced methods, but it is simple and it illustrates the strengths and weak¬ 
nesses of numerical calculations. An interested student should consult Refs. 3 and 
4 to leam about other numerical methods. 

For simplicity we limit the calculations to Laplace’s equation in two dimen¬ 
sions, for V{x, y). The generalization to three dimensions is obvious. We start 
by setting up a square lattice network of discrete points (“sites”) and connecting 
line segments (“links”) that covers the region of interest where V 2 V — 0. The 
lattice is oriented with the x and y axes. The sites must be dense enough that the 
potential at any point (x, y) can be well approximated by interpolation between 
the values of V at nearby sites. Let e be the distance between neighboring sites, 
in either the x or y direction. Then the sites are labeled by two integers m and n, 
with the position of the site (m, n) at 

x mn =ime + j«e. (5.134) 

The limits on m and n depend on the boundary of the region of interest. For 
example, for a square with —a/2 < x < a/2 and —a/2 < y < a/2 the integers 
m and n would run from —N to N , where N — \a/e, as shown in Fig. 5.18. We 
denote by V mn the potential at x mn . 

The Laplacian V 2 is a differential operator. In the continuum it involves sec¬ 
ond derivatives, but on the lattice it will be approximated by a finite difference 
operation. The difference between values of V at points separated by e i is ap¬ 
proximately 

- 4 H. Kober, Dictionary of Conformal Representations (Dover, New York, 1952); V.I. Ivanov and 
M.K. Trubetskov, Handbook of Conformal Mapping (CRC Press, Boca Raton, FL, 1995). 
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FIGURE 5.18 A square lattice. In Example 10 the sides at x — ±a/2 are grounded 
('V = 0) and the sides at y = ±a/2 are held at potential Vo- 


V(x + ei)~ V(x)«e —. (5.135) 

ax 

The right-hand side is the first term in a Taylor series. In this way the first deriva¬ 
tive is related approximately to the finite difference. The second derivative may 
be approximated by a difference of differences, based on the approximate relation 

[V(x + ei) - V (x)] - [V(x) - V (x - ei)] « e 2 ~, (5.136) 

where the right-hand side is the first nonzero term in a Taylor series of the left- 
hand side. Taking x to be the position \ mn of site (m, n), Eq. (5.136) becomes 


Vm+l,n 2V mn + V m —\ n — € 
similarly, for differences in the y direction, 


,9 2 V 

ax 1 ’ 


2 9 2 V 

Vm,n+1 ~~ 2V m n + Vm.n -1 — e „ 2 ' 

dy 


(5.137) 


(5.138) 


Laplace’s equation is 9 2 V/9x 2 + 9 2 V/9y 2 = 0. If we substitute the finite differ¬ 
ence approximation for the second derivatives then the partial differential equation 
becomes the, finite difference equation 


V m +l.n T V m —l,n T Vm,n+ 1 T Vm.n — 1 — 4V mn . 


(5.139) 
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According to (5.139) the potential at x mn is equal to the average of the po¬ 
tential at the four nearest neighbor sites of the lattice. The method of iterative 
relaxation makes use of this property of Laplace’s equation to generate the solu¬ 
tion. Start with a set of initial estimates for the potential values V mn at the lattice 
sites x mn . Sites on the boundary must be assigned their specified values. Now 
move around the interior of the space, stopping at each lattice site exactly once; 
at each stop replace the current potential value V mn by the average of the values 
at the neighboring sites. Readjusting the values V mn in this way for all the interior 
sites is called one iteration. After many iterations the potential values converge to 
the solution of the difference equation (5.139). 

It is clear that V mn only approaches the solution in the limit of many iterations. 
The readjustment at one site ( m , n) brings it into temporary agreement with the 
neighboring sites. But each of the neighboring sites will be changed later, during 
either the same iteration or the next one, after which V mn will no longer agree 
with its neighbors. But the disagreement decreases as the number of iterations 
increases. When an iteration produces negligible changes in the potential values, 
then the process has converged to a satisfactory approximation of the solution to 
the difference equation (5.139). 

The process is called relaxation because it is an approach to equilibrium. In 
the continuum, the state of static equilibrium occurs when V 2 V = 0, with the 
boundary conditions satisfied. It can be shown 25 that the integral f(W) 2 d 3 x 
has its minimum value when V (x) satisfies Laplace’s equation, always keeping 
the boundary conditions satisfied. That is, equilibrium is the state with minimum 
electric field energy. Similarly, for the discrete lattice approximation the sum de¬ 
fined by 

S =J2 [(Vm+l.n - Vmn) 2 + (Vm.n+1 ~ V mn ) 2 J (5.140) 

m,n 

is minimum when V satisfies the difference equation (5.139). Each step in an 
iteration lowers the value of S, so during the iterative process the potential relaxes 
to the equilibrium state. 

The iteration generates a solution to the finite difference approximation of 
Laplace’s equation. How well that approximates the continuum potential depends 
on € (the lattice spacing) that must be small compared to any length scale over 
which the potential varies by a significant amount. The smaller e, the better the 
approximation; but also the longer the calculation, because each iteration must 
visit each site, and the number of sites is proportional to 1/e 2 (in two dimen¬ 
sions). Numerical calculations require compromises between accuracy and speed. 

The basic idea (5.139) that V(x) equals the average of V at nearby sites, in 
the limit of small e, can be extended to three dimensions. The analogous property 
in one dimension is particularly simple: The equation in one dimension (x) is 
d 2 V/dx 2 — 0, with general solution V — ax + b. This has the property that 
V(x) — 2 [V(x — 8) + V(x + <5)| for any <5. In words, V(x) equals the average 

25 


See Exercise 24. 
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of V at the endpoints of any interval centered at x. There is a similar theorem for 
three dimensions: In a charge-free region of space, V at any point P equals the 
average of V (x) over the surface of any sphere with P at the center. 26 


EXAMPLE 10 As an example of the relaxation method, we will calculate again 
the potential V (x, y) in the infinitely long hollow square conducting pipe shown 
in Fig. 5.1. We constructed an analytic solution for this problem in Example 1 (the 
infinite series in (5.31)) so we shall be able to check the accuracy of the numerical 
method. 

A lattice network for this problem is shown in Fig. 5.18. For this computer cal¬ 
culation the lattice spacing e is taken to be 0.01a (much finer than the schematic 
mesh shown in Fig. 5.18) where a is the side of the square. The coordinates x 
and y vary from —a/2 to a/2, so the domains of the site labels m and n are the 
integers from —50 to 50. The boundary values in terms of site labels are 

F-50,n — V+50,„ — 0 

VV-so = V m ,+ so = V 0 = 1, (5.141) 

where we use Vq as the unit of potential. 

Asa starting configuration we set V mn = 0.5 at all interior points. The graph 
on the right of Figure 5.19 is the quantity S of (5.140) as a function of iteration 
number, for 1000 iterations. As the V mn converge to the solution of (5.139), S 
decreases monotonically. The graph shows that it takes at least 1000 iterations to 
converge to a good approximation of the solution. We could follow other vari¬ 
ables to double-check the convergence. Figure 5.20 shows V m ,o as a function of 
m and Vo,n as a function of n after 1000 iterations. These are the discrete approx- 
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FIGURE 5.19 Convergence of the relaxation for Example 10. The graph on the left 
shows the variation of V (0, 0.25) as the iteration proceeds for 1000 relaxation steps; the 
graph on the right shows the quantity S of (5.140), which decreases monotonically as the 
iteration proceeds. 


26 


See Exercise 25. 
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FIGURE 5.20 Potential in the square pipe calculated by the method of iterative re¬ 
laxation. The potential values at every other lattice site (n even) along (a) the x axis, and 
(b) the y axis, are plotted. 


imations of the potential on the x and y axes. Note that the lattice approximation 
agrees accurately with the saddle shape shown in Fig. 5.5, the result of the analytic 
calculation. 

In some ways the numerical solution is superior to the analytic solution. The 
latter is still an infinite series, and it is nontrivial to evaluate V{x,y) from this 
series, especially if x or y approaches the boundary. In a real application of elec¬ 
trostatics, having an array of numerical values of V (x) on a dense set of points 
in the space, between which V may be estimated by interpolation, may be more 
useful than having a slowly convergent infinite series. 


EXAMPLE 11 The real power of numerical methods is that, unlike analytic meth¬ 
ods, they are not limited, practically, to highly symmetric systems. For example, 
we now relax the system shown in Fig. 5.21, the cross section of a conducting 
pipe crimped at one comer. Again we’ll set V = 0 on the walls at x = ±a/2, 
and V = Vq = 1 on the other boundaries. Although it may be possible to solve 
Laplace’s equation analytically with these boundary conditions, it would be very 
complicated. The numerical calculation is almost as simple as the earlier symmet¬ 
ric example. The only change is a slight modification in defining the boundary 
points around the crimp. 

Figure 5.22 shows the potential values V mi o and Vo,„ where again m and n 
run from —50 to 50, after 1000 iterations. Having seen that the relaxation method 
performs well in the symmetric case, where we can compare it to analytic results, 
we may be confident that the relaxation method is accurate in the asymmetric case 
for which we have no analytic solution. 
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y 



FIGURE 5.21 An asymmetric boundary. In Example 11 the sides at x = ±a/2 are 
grounded (F = 0) and the other boundaries are held at potential Vq- The top boundary 
consists of three parts: y = 0.5a for —0.5 a < x < 0.2a, 0.3a < y < 0.5a for x — 0.2a, 
and y = 0.3a for 0.2a < x < 0.5a. 


v/v 0 



-6.4 -0.2 0.2 0.4 


x/a or y/a 

FIGURE 5.22 Potential in the asymmetric pipe calculated by the method of iterative 
relaxation. The potential values at every other lattice site (n even) along (a) the x axis, and 
(b) the y axis, are plotted. 
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Many subtle and difficult issues are associated with numerical methods. For 
example, any numerical method is an approximation, so it is important to estimate 
the error of the approximation. Despite such problems, the computer has replaced 
the classic analytic techniques as the best way to calculate complex fields for real 
applications encountered in engineering and experimental physics. This does not 
mean, though, that analytic methods are obsolete. Any computer code must be 
verified for correctness and accuracy. The best verification is to run the program 
for examples with known solutions, and obtain the right values. Computers extend 
our ability to do calculations, but that only increases the importance of analytic 
results. 
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EXERCISES 

Sec. 5.1. Separation of Variables for Cartesian Coordinates 

5.1. (a) What choice of the parameters in (5.13) corresponds to the uniform field E = 

£ 0 k? 

(b) Describe the equipotentials for (5.13) if a — c and / are nonzero, and all other 
parameters are 0. 

5.2. Prove the orthogonality relations in (5.24), which are important in Fourier analysis. 

5.3. For the long square conducting pipe in Example 1, suppose that the potential on the 
sides at x — ±a /2 is 0, and the potential on the sides at y = ±a /2 is To cos(.Tx/a). 
Then find V(x, y) in the pipe, and sketch a graph of the equipotential curves in the 
xy plane. 

5.4. Calculate the integral in (5.40) if z) is a constant Vq, and verify thatthe result 
is (5.41). 
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5.5. In Example 2 we found that the potential inside the grounded conducting pipe in 
Fig. 5.6, with the end at x = 0 held at constant potential Vq> is 


V(.x.y.z) 


I6V 0 
n 2 


E E 

n odd m odd 


£ KnmX. 

nm 


sin 



(5.142) 


(a) Show that the Fourier series of a square wave f (y ) that has wavelength 2ly and 
amplitude 1 , with / = 1 for 0 < y < £2 and / = — 1 for —£2 < y < 0 , is 


fiy) = - E ~" sin 

71 n odd " 



(Hint: This square wave is like the one in Fig. 5.2, but with a phase shift of 90°.) 

(b) In particular, when the function in (a) is evaluated at y = £ 2/2 (in the region 
where / = 1 ) the result is one of the most remarkable infinite series in mathe¬ 
matics 



This series, discovered by Leibniz in 1673, shows that n is actually a very simple 
number, because it can be expressed using every odd integer exactly once. Prove 
Leibniz’s series from the Taylor series of arctan(x). 

(c) Use a computer program to plot the series /( y) truncated at n = 1 and at n = 
11, as a function of y with £2 = 1. (Superimpose the two curves, and show at 
least one full wavelength.) 

(d) Use a computer program to plot V(x, j, j) as a function of x, for £2 = £3 = 1 . 
The result shows that the electric field does not penetrate far into the pipe. 

5.6. Sketch the electric field lines in a semi-infinite rectangular pipe (as in Fig. 5.6) if the 
end plate has constant potential Vq and the walls are grounded. Also, describe the 
surface charge density. 

5.7. Show that if in Example 1 the three sides of the square x = ±a/2 and y = —a/2 
are grounded and the fourth side y = +a/2 is held at potential Vq, then inside the 
square the potential is 


V(x,y) = 


4 Vo 


vA (—i) n r 71 x 1 

£„STT c “[ <2 " + 1 ) v] 


n =0 


sinh [( 2 n + l)^(y/a + 1 / 2 )] 
sinh [( 2 n + l) 7 r] 


(Hint: In this configuration the potential is neither even nor odd in the variable y, so 
both sinh ky and cosh ky appear in the expression for V (x, y).) 

5.8. (a) From the result of the previous exercise and the superposition principle, show 
how to find V(x, y) if the sides of the square in Example 1 are held at arbitrary 
constant potentials, e.g., Vj at x = a/2, V 2 at y = a/2, V 3 at x = —a/2, and 0 
at y = —a /2 . 

(b) What is the potential at the originforthis case? [Answer: V (0, 0) = (Vi + V 2 + 
V 3 V 4 , which is just the average of the potential on the four sides of the square.] 
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Sec. 5.2. Separation of Variables for Spherical Coordinates 

5.9. Two concentric spherical conducting shells of radii R\ and Rj have potentials V\ 

and Vj. respectively. Find V (r) between the shells and verify V 2 V = 0 explicitly. 

5.10. On Legendre polynomials. 

(a) Use the recursion relation (5.52), and the normalization condition P( (1) = 1, to 
determine Pq(u ), P\(u), P 2 W, Pi(u). For the even polynomials, start with C'q 
and use (5.52) to calculate Cj, C 4 , C'g ... until the sequence terminates. Then 
determine Cq such that P^d) = 1. For the odd polynomials start with C 1 . 

(b) Use a mathematical software package to look up Pio(u) and P\ 1 (u). 

(c) From Legendre’s equation prove the orthogonality relation 



(d) Derive Eq. (5.62) from Legendre’s equation. 

5.11. (a) If an object in an applied electric field E app acquires an induced dipole moment 

p = crEapp, then a is called the polarizability of the object. Determine the 
polarizability of a conducting sphere of radius a. 

(b) Determine the surface charge density a (9) of the sphere, for E app = £fjk. 

(c) Verify that p = f r dq, which is the general definition of the dipole moment of 
a distribution of charge. (Here dq = ad A.) 

5.12. Derive the Taylor series ini for the function g(u, s) = (1 — 2us + .v 2 ) -1 / 2 through 
order j 3 . Show that the coefficients are the first four Legendre polynomials. 

5.13. Express the potential of a pointlike linear quadrupole, oriented parallel to the z axis, 
in Legendre polynomials. There are three charges: +q, —2q, and +q on the z axis, 
with —2 q at the origin, and +q at z = —a and +a. 


Sec. 5.3. Separation of Variables for Cylindrical Coordinates 

5.14. Two concentric long circular cylinders of radii £[ and R 2 have potentials Vj and 
V 2 , respectively. Find V(r) between the cylinders and verify V 2 V = 0 explicitly. 

5.15. What is the potential V ( r , <p) due to a cylinder of radius R. for which the upper-half 
cylinder 0 < <p < 71 is a conductor at potential + Vq, and the lower-half cylinder 
jt < <p < 2tz is a conductor at potential — Vq? The two half cylinders are separated 
by small insulating gaps at <p = 0 and <p = it. 

[Answer: 


E ex t (r, <p) = 



sin k<p. 


k odd 


5.16. What is the potential V ( r , <p) due to a cylindrical surface of radius R on which there 
is a surface charge density a(<p) = <tq cos <p r ! [Answer: 
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Vext(r, <t>) = 


CTqR 2 COS(J 

2e 0 r 


and Vint(r, </>) = 


a^r cos <p 
'"260 ’ 


5.17. What is the potential V (r, </>) due to a cylindrical surface of radius R, on which there 
is a surface charge density a (<p) — +ctq on the right half (in/2 < ip < 2 tt and 
0 < 4> < tr/2), and a (<p) = —ctq on the left half (n /2 < <p < in/2)l 

[Answer: 


V e xt(A </>) = 


Vint (A </>) = 


2a ()R 


E 


c-iy /k \ 22+1 


(2 j + l) 2 


(?) 


cos (2_/ + !)</>, 


2apR 

e 0 n 


E 

>=0 


(—1) J 


(2y +1) 2 (r) 


cos(2 j + 1)0.] 


Sec. 5.4. Conjugate Functions 

5.18. Show that the Cauchy-Riemann equations imply V 2 g = 0 and V~h = 0 in two 
dimensions. 

5.19. Sketch, qualitatively, the equipotentials and electric field lines of two perpendicular 
charged conducting plates, similar to Fig. 5.15 but with finite extent in x and y. 
Include curves far from the plates. 

5.20. Let F(z) = Cz be the complex function in the method of conjugate functions, where 
C is a real constant. What boundary value problem is solved if Re F is taken to be 
the potential? What boundary value problem is solved if Im F is taken to be the 
potential? 

5.21. Consider the function F(z) = iC In z, where z = x + iy and C is a real constant. 

(a) Let Im F be the potential function. What are the equipotentials and electric field 
lines? What is the charge distribution? 

(b) What boundary value problem is solved if Re F is taken to be the potential? 

5.22. In Example 7 we considered plates that intersect at a right angle. Find the potential 
V (x, y) in the wedge-shaped region between the plates if they intersect at angle 
a < 7t/2. (Hint: Try the complex function F(z) = Az p + i<t> 0 and determine the 
appropriate exponent p.) 

5.23. For the conformal map w —*■ z in (5.129), a curve in the complex w plane is mapped 
into a curve in the complex z plane; the curve in z is called the image of the curve in 

w. 

(a) What is the image of the upper half of the unit circle of the w plane? 

(b) What is the image of the upper side of the positive real w axis? What is the 
image of the lower side of the positive real w axis? 

(c) What is the image of the negative real w axis? [Answer: The real w axis is 
mapped into the conductors of the semi-infinite parallel-plate capacitor of Fig. 
5.17.] 
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General Exercises 

5.24. Consider an electrostatic system that consists of a set of conductors C\ , C2, C3 ,... 
held at specified potentials Vj, Vj, V3 ,.... Prove that the electrostatic field is — V4>, 
where <J>(x) is the function that minimizes U = ^ Jy (V<b) 2 d^x, subject to the 
boundary conditions <t> = V,- on C,-. The volume of integration V is the free space— 
the space outside the conductors. Thus electrostatic equilibrium is the state with 
minimum field energy satisfying the boundary conditions. 

5.25. Let P be an arbitrary point in a charge-free region of an electrostatic system, and set 
up a coordinate system with P at the origin. The average potential on a sphere 5 of 
radius r around P is 

V(r) = - z (f> V(r, 6, (p)r 2 sin 6d9d<p. 

4rrr z Js 

(Note, r is constant on S.) 

(a) Prove that d V/dr = 0, assuming there is no charge enclosed by S. 

(b) Prove that V is equal to the potential at P. 

5.26. From the result of the previous exercise, prove that V(x) cannot be a maximum 
or a minimum at a charge-free point. Prove that a point charge cannot be at stable 
equilibrium with electrostatic forces only (Eamshaw’s theorem). 

5.27. Consider again the long square pipe shown in Fig. 5.1 with boundary potentials 0 at 
x = ±a/2 and Vo at y = dza/2. 

(a) Show that the transformation (x, y) —► (y, x), i.e., reflection about the diago¬ 
nal y = x, produces an equivalent system. Show from this symmetry that the 
potential is constant on the diagonals of the square cross section. (Hint: Sketch 
qualitatively how the equipotentials must look.) Prove that this constant poten¬ 
tial must be Vq/2. 

(b) Verify numerically that V (0, 0) = Vo/2, by evaluating the series (5.31) by com¬ 
puter. Note, too, that we uncover from this exercise the beautiful and mysterious 
identity 

1 y (- 1 )" 1 

8 2n + 1 cosh [(2 n + 1 )tt/ 2] 

5.28. Consider the long conducting pipe shown in cross section in Fig. 5.1. But in this 
exercise let the walls at x = ±a/2 be grounded (V = 0) and the walls at y = ±a/2 
be held at equal but opposite potentials ±Vq. 

(a) Determine V(x, y). (Hint: Exploit the idea that it is now an odd function of y.) 

(b) Use a computer to plot the potential V (0, y) versus y. (Let Vo be the unit of po¬ 
tential and a the unit of length.) A computer program is useful for this purpose, 
because the result of (a) is an infinite series which must be evaluated numeri¬ 
cally. 

(c) Calculate the electric field at the origin. 

5.29. Solve Laplace’s equation in one dimension (x) in the bounded domain x\ < x < xj 
for boundary values V(xj) = Vi and V(x 2 ) = V 2 . 
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5.30. Consider a cube made of 6 conducting plates of size a x a, that encloses the space 
—a/2 < x, y, z < a/2. The plates at z = ±a/2 are held at potentials ± Vo, respec¬ 
tively. The other sides are grounded (V = 0). The separable solutions of Laplace’s 
equation inside the cube have the form 


{cosh(/c„ m z) or sinh(/c nm z)}cos[( 2 n + l)jr*/a] cos [( 2 m + \)ny/a] 


where n and m are non-negative integers. 

(a) Solve the boundary value problem for V ( x , y, z) inside the cube. 

(b) The potential is obviously 0 at the center, but what is the electric field there? 
Express E z = — 3 V/3z|(o,o,0) as an infinite series, and use a computer to find 
the numerical value in terms of Vq /a. 

5.31. Imagine a conductor consisting of three semi-infinite plates that intersect at right 
angles, making a comer like the floor and two walls of a room. Let the plates be the 
++ quadrants of the xy, yz, zx planes. 

(a) Show that the potential in the +++ octant is V (x) = C\xyz + C 2 . The constant 
C 1 depends on the charge on the plates. 

(b) Describe the surface charge densities on the boundary planes. 

5.32. Suppose a grounded spherical conducting shell of radius R surrounds a pointlike 
dipole at the center with p = pk. Find the potential V(r, 9) forr < R. (Hint: Use 
spherical harmonics regular at r = 0 to satisfy the boundary condition.) 


5.33. (a) Suppose on the surface of a sphere of radius R there is a surface charge density 
<7 (9) = cto cos 2 9, where cto is a constant. What is V (r, 9) inside and outside the 
sphere? 

(b) Suppose now that a (9) = cto sin 2 9. What is V(r, 9) inside and outside the 
sphere in this case? 

[Answer to (b): 


Vint(L 9 ) 


2 Ra 0 

3 eo 


V ex t(>\ 9) = 


2 Ro'o 

3 e 0 


2 R<7 o / r \ 2 

P2(C ° S0) 

2 Rv 0 (R\ 3 

P2 < COS ^ 


5.34. Consider a long circular cylinder of radius R bisected by a plane parallel to the axis 
of the cylinder, with the two halves insulated from one another. (Let the z axis be the 
central axis of the cylinder, and the bisecting plane the xz plane.) One hemicylinder 
(0 < 0 < 7 r) is grounded and the other (it < <f> < 2n) is held at potential Vo- 

(a) Use the method of separation of variables to determine the potential V(r, </>). 

(b) Evaluate the potential at r = 0. 

(c) Evaluate the electric field at r = 0. 

(d) Use a computer to plot V(R/2, 4>) as a function of </>. (It is most convenient to 
use a mathematical software package to calculate numerically the infinite series 
and plot the results.) 

5.35. Suppose V(x) satisfies Laplace’s equation in three dimensions. Show that the po¬ 
tential at x is approximately equal to the average of the potentials at the six points 
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x±ei,x±ej,x±ek, where e is small, and that the error of the approximation is 
of order e 3 . 

5.36. The real part of F(z) = (—X/2neo)lnz, where z — x + iy, is the potential function 
of a two-dimensional electrostatic system. What is the charge distribution? Sketch 
the equipotentials and electric field lines. 

5.37. An array of long charged wires, parallel to the z axis and passing through the x 
axis at x = 0, ±f , ±2 l, ±3f ,..., is part of the design of a multiwire proportional 
counter (MWPC), invented by Charpak (Nobel prize, 1994). Neglecting end effects, 
this is a two-dimensional system. 

(a) Show that the potential V ( x , y) is the real part of 

F(z) = (-A/2jreo)ln[sin(;rz/f)] 

where z = x + iy. (Hints: We know that Re F satisfies Laplace’s equation, so 
we only need to verify the boundary conditions at the wires. Show that Re F is 
a periodic function of x with period l. What is V{x,y) near the wire at ( x , y) = 
( 0 , 0 )?) 

(b) Show that 

X [ (2ny\ (2nx\~ 

l~J" co T“r].r 

(c) Sketch the equipotential lines and electric field lines, or, better, use computer 
graphics to make a contour plot of V( x , y). 

(d) Show that in the limit y —>■ oo the electric field approaches the field of a charged 
plane with charge per unit area X/i. Does this result make sense? 

5.38. The sentence at the head of this chapter is part of a longer and rather serious quote 
from the essay: “The Unreasonable Effectiveness of Mathematics in the Natural Sci¬ 
ences,” by E. R Wigner. It is in the book Symmetries and Reflections (Indiana Univ. 
Press, Bloomington, 1967, W. J. Moore and M. Scriven, eds.) 

“The miracle of the appropriateness of the language of mathematics for 
the formulation of the laws of physics is a wonderful gift which we nei¬ 
ther understand nor deserve. We should be grateful for it and hope that 
it will remain valid in future research and that it will extend, for better 
or for worse, to our pleasure, even though perhaps to our bafflement, to 
wide branches of learning.” 

Now that you’ve had experience solving many mathematical exercises, what do you 
think of this? 


Computer Exercises 

5.39. What is the electric field on the midplane of the plates of a semi-infinite parallel plate 
capacitor, both inside and outside the capacitor? The plates, shown in Fig. 5.17, are 
half-planes at y = 0 and y = d with x < 0. The midplane is y = d/2. The problem 
is to calculate E y (x, d/2). In the notation of Example 9, 
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(f). 


E ,(x.y) = - — = + 


(dih 

Va* 


).• 


the second equality by the Cauchy-Riemann relations. In terms of the parameters p 
and 6 of Example 9, the midplane has 6 = n /2, and 


3 h _ (dh/dp)Q =x /2 
dx (dx/dp) e=7t /2 ' 


(a) Compute E y (x, d/2) in units of Vo/d at x — —2d, —d, 0, d. (Neglecting edge 
effects E y would be — Vo/d between the plates.) 

(b) Use computer graphics to plot E y (x, d/2) in units of Vo/d as a function oix/d. 
(Hint: Use parametric plot.) 

(c) For what x on the midplane is the field strength 0.1 Vo/d? 


5.40. Consider the system of Example 10, a long pipe with a square cross section of di¬ 
mensions a x a. Let the z axis be the central axis of the pipe, and suppose the sides at 
x — ±a/2 are grounded (V = 0) while the sides at y — ±a/2 are held at potential 
Vo- Write a computer program to solve Laplace’s equation for V(x, y) by iterative 
relaxation. Make plots of V (jc, 0) versus x and V (0, y) versus y. 

5.41. Modify the program in Exercise 40 to solve Laplace’s equation for V(x, y) with 
these boundary conditions: 


V(±a/ 2, y) = 0 

V(x, +a/2) = 0 and V(x, -a/2) = V 0 ; 


that is, three sides are grounded, and the bottom side is at Vq. Make plots of V(x, 0) 
versus x and V(0, y) versus y. Show that V(0, 0) = 0.25 Vo- 

5.42. By iterative relaxation solve Laplace’s equation in a square (—a/2 < x < a/2 and 
—a/2 < y < a/2) with these boundary conditions: 


V(±a/ 2, y) = V(x, ±a/2) = 0 

V(x, a/10) — —Vo for -a/4 <x <a/4 

V(x, — a/10) = +Vo for — a/4 < x < a/4. 


That is, the four sides of the square are grounded (V = 0); and in the square there 
are parallel line segments (i.e., plates extending in the z direction) at y = ±0.1a 
with x from —0.25a to 0.25a held at potentials +Vq. 

(a) Plot E y (x, 0) versus x. Like a capacitor, the electric field is approximately uni¬ 
form between the line segments and decreases outside. In the lattice approxima¬ 
tion E y is computed from a finite difference 


Ey(x, 0) = - 


V (x, +€) - V(x, -e) 
2e 


where e is the lattice spacing. 

(b) The energy density u(x, y) is proportional to E^(x, y) + Ey(x, y). Calcu¬ 
late E%(x,y) + Ey(x,y) for representative points in the square: e.g., (0,0), 
(0.25a, 0), (0.4a, 0), and (0, 0.3a). 



CHAPTER 



Electrostatics and Dielectrics 


Up to now we have studied electrostatic systems of charges and conductors. Now 
we are ready to include insulators, also called dielectrics, in the systems we study. 

Faraday coined the word “dielectric” to describe the effects he had discovered 
when an insulator is put into a capacitor. 1 He found that when a capacitor is filled 
with an insulating material—a dielectric—more charge and energy are stored at 
a given potential than without the dielectric. The capacitance of the system, the 
energy stored, and the charge on the conductors, are all greater by a factor k, 
which is a characteristic property of the dielectric material called its dielectric 
constant. There is a wide range of k among dielectrics. Capacitor design is an 
important topic in applied physics, and the properties of dielectrics are crucial, 
because capacitors are key elements in circuits. 

In a metal, the atoms release one or more outer electrons to form a sea of elec¬ 
trons that are free to move throughout the conductor. When an external electric 
field is applied, as we have learned, free electrons become distributed on the sur¬ 
face in such a way that in electrostatic equilibrium there is zero field and zero net 
charge density inside the conductor. 

In a dielectric, by contrast, the electrons are not free, but rather bound to their 
atoms or molecules. When an external electric field is applied to a dielectric, the 
electrons and nuclei become displaced by small distances, in the direction of E in 
the case of the nuclei, or in the opposite direction in the case of the electrons. But 
the electrons cannot escape from the Coulomb forces that bind them to the asso¬ 
ciated nuclei. In an atomic dielectric, solid argon for example, the electrons are 
bound to atoms. In a molecular dielectric, solid CO 2 for example, the electrons are 
bound to molecules. The opposite displacements of positive and negative charges 
result in a polarized atom or molecule, with a nonzero dipole moment. In cases 
such as these, the dipole is called an induced dipole. Figure 6.1 illustrates an in¬ 
duced atomic dipole. We also say that the bulk dielectric, made up of polarized 
atoms or molecules, has become polarized, because it has acquired a net dipole 
moment. The bulk material may also have acquired a net charge density inside, 
as we will see. So unlike a conductor, a dielectric may have a nonzero field and 
charge density inside. 

Some dielectrics are composed of molecules whose electronic structure is 
asymmetric in space, so that the molecule has a permanent dipole moment, even 
in the absence of an external field. Water (H 2 O), nitric oxide (NO), and car- 

1 Recall that the SI unit of capacitance, the farad (F), is named for Faraday because of his discoveries. 
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FIGURE 6.1 An induced atomic dipole. The atom on the left is unpolarized. When a 
field is applied, the nucleus is pulled in the direction of E and the electrons in the opposite 
direction, so the atom has a dipole moment proportional to E. 


bon monoxide (CO) are examples. These are called polar molecules. In such 
dielectrics there is an additional mechanism of polarization, besides displacement 
of charges within the molecule, due to orientation of the permanent moment, 
which contributes to the polarization of the bulk material. 

Because these polarization processes involve bound electrons, dielectric prop¬ 
erties inevitably depend on the details of atomic and molecular structure. There¬ 
fore dielectric properties vary widely. Of course the properties of conductors also 
depend to some extent on atomic and molecular interactions, as we can see by 
comparing, say, lithium to lead. But the behavior of different metals in a static 
electric field is more or less the same. The behavior of different dielectrics can be 
quite different. Our presentation of this subject begins with a discussion of atoms 
and molecules as electric dipoles. That will prepare us to explore the connection 
between the dielectric properties of bulk matter and their microscopic origin. 


6.1 ■ THE ATOM AS AN ELECTRIC DIPOLE 

We can explain the magnitude of dielectric effects based on simple models of the 
atomic or molecular dipole moments. 

6.1.1 ■ Induced Dipoles 

In equilibrium, an isolated atom in an electric field E has a dipole moment p 
proportional to the field. 


p = aE, (6.1) 

because the field pulls the nucleus and the electrons in opposite directions, as 
illustrated in Fig. 6.1. The constant of proportionality a is called the atomic po¬ 
larizability. Its units are rather clumsy, namely, Cm 2 /V. Atomic polarizabilities 
must ultimately be measured by experiment, but we can estimate their order of 
magnitude by modeling the atom as a conducting sphere whose radius R is a 
typical atomic size, say R ~ 10~ 10 m. We recall from Chapter 4 that the in¬ 
duced dipole moment of a conducting sphere is p = 47reo/? 3 E, which is pro- 
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TABLE 6.1 Atomic polarizabilities 



a/(47reo) in units of 10 

~ 3 0 m 3 

H 

0.667 

He 

0.205 

Li 

24.3 

Ne 

0.396 

Na 

23.6 

Ar 

1.64 

K 

43.4 

Kr 

2.48 

Rb 

47.3 

Xe 

4.04 

Cs 

59.6 




portional to the sphere’s volume. The model polarizability is a = 4neoR i , so 
a/(4jreol = 10 _30 m 3 for a typical atom. It is customary to give polarizabilities 
in the form a/(4rreol because this quantity has simply the units of volume. 

Table 6.1 lists the polarizabilities of a few simple atoms. Note that in all cases 
a/(4 tc€q) is of order 10 _30 m 3 , in accord with the above model, but there is 
marked variation. Atoms of the alkali metals, in the left column of Table 6.1, 
have large polarizabilities because they have a loosely bound outer electron. The 
values of a for Li, Na, K, Rb, and Cs were determined by measuring the deflec¬ 
tions of atomic beams in a nonuniform electric field. Atoms of the noble gases, 
in the right column of Table 6.1, have small polarizabilities because their electron 
shells are all filled. The values of a for He, Ne, Ar, Kr, and Xe were determined 
by measuring the dielectric constant in a capacitor filled with the element. Of all 
the elements, He has the smallest polarizability. The largest atomic polarizability 
that has been measured is for Cs, but presumably that of Fr is still larger. As Ta¬ 
ble 6.1 shows, polarizabilities within each column do increase with atomic size. 
However, they are not simply proportional to R 3 , as the naive model predicts. 

A precise theory of atomic polarizability requires quantum mechanics. The 
simplest case is the hydrogen atom. In bare outline the procedure for calculating 
a of atomic hydrogen is as follows. The electron wavefunction xp(x) obeys the 
Schrodinger equation 

H 2 e 2 

VV-- - ir+eE 0 z^ = £f (6.2) 

2m 4neor 

where £ is the electron energy and L’ok is the external electric field. The equa¬ 
tion may be solved for small Eo by perturbation theory [1,2]. The electric dipole 
moment is classically —ex, because x is the vector from the proton to the elec¬ 
tron. The mean dipole moment in quantum mechanics is the expectation value 
p = {ip- 1 — ex\xp). From the solution of the Schrodinger equation, p turns out to 
be 

p = !(47reo<4)£ok ( 6 - 3 ) 

where ag is the Bohr radius. Comparing to (6.1) the quantum theory of the hy¬ 
drogen atom gives 
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= -al = 0.667 x 10“ 30 m 3 . (6.4) 

4 ne 0 2 

This calculated value is used for H in Table 6.1. The calculation for larger atoms 
is more difficult, but the results of quantum theory agree with the measurements 
in Table 6.1. 


6.1.2 ■ Polar Molecules 


Some molecules have a permanent dipole moment. For example, the electron den¬ 
sity in an HC1 molecule is not symmetric about the center, but rather skewed to¬ 
ward the Cl nucleus, so that there is a permanent dipole moment (in the body-fixed 
frame of reference) pointing from Cl to H. Another example is the water molecule 
H 2 O, which is an asymmetric molecule with an electric dipole moment pointing 
from O toward the midpoint of the line connecting the H’s. 

Dipole moments of polar molecules are expressed in units of debyes (D) where 
1 D = 3.34 x 10 ~ 30 C m. Some typical dipole moments of small polar molecules 
are 1.75 D for HF, 1.04D for HC1, 0.80 D for HBr, 0.83 D for HI, 1.83 D for 
H 2 O, and 1.48D for the triangular pyramid molecule NH 3 . For comparison, eag 
is 2.5 D. Large molecules can have large dipole moments; that of hemoglobin, for 
example, is hundreds of debyes. 

If a material is composed of polar molecules, then the permanent molecu¬ 
lar moments create a dielectric effect whose magnitude depends on tempera¬ 
ture. When a polar molecule is placed in an electric field E = Eok, there is a 
torque p x E twisting the dipole toward alignment with E; the potential energy 
is U = — p • E. We may estimate the thermal average (p) of the dipole moment, 
as a function of the temperature T, by treating the molecule as a classical ro¬ 
tor and applying statistical mechanics. The dipole orientations have a Boltzmann 
distribution, and the mean dipole moment is 


<P) = 


f e u ! kT p dSl p 
f e~ u ! kT d£l p 


(6.5) 


The dipole orientation can be specified by the polar angle 9 (the angle between p 
and E) and the azimuthal angle cp (the angle around the field line) as 


p = p sin 9 cos (p + j sin 0 sin</> + kcos 0 ^ . ( 6 . 6 ) 

Then the integration over the orientations of p is dQ. p = sin 9d9d<p. The integral 
over <p is zero for the i and j components, because the energy U = —pEo cos 9 
is independent of <p. So the mean dipole moment points in the k direction, (p) = 
(p z ) k, with 


( Pz) = 


p Jq e v ! kT cos 9 sinOdO 
Jq e~ u / kT sin 9dB 


P /_!] e au udu 
j e au du 


(6.7) 
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(p)/p 



pEJkT 


FIGURE 6.2 The Langevin formula. A molecule with a permanent dipole moment of 
magnitude p is placed in an electric field £()k. The thermal average of the dipole moment 
vector at temperature T is (p z ) k, and it depends on pEo/kT. 


In the second equality we have changed the variable of integration to u = cos 0 
and defined the dimensionless parameter a = pEo/kT. The integrals are elemen¬ 
tary, and the result is 


(Pz) = P 


-h coth a 

a 


( 6 . 8 ) 


This result, called Langevin’s formula, is plotted in Fig. 6.2. At low temperature 
(a » 1) the dipole aligns with E and so (p z ) approaches p. At high temperature 
(a <£ 1) the thermal fluctuations of the dipole orientation are large, so the mean 
value (p z ) approaches 0. More precisely, in the case pE o <<C kT we may approx¬ 
imate {p z ) by pa/ 3. (The Taylor series for coth a is 1/a + a/3 + a 3 /45 + ■■■■) 
Then the mean dipole moment is a linear function of E, equal to aE with 



(6.9) 


This a is the polarizability of a molecule with a permanent dipole moment p at 
temperature T. The temperature dependence a oc T~ ] is called a Curie law. In 
deriving (6.9) we ignored intermolecular interactions. Therefore the result holds 
only when interactions with other polar molecules are negligible, as in gases. 

A polar molecule in an electric field also has an induced dipole moment due 
to distortion of the electron wave function, which contributes a term a o to the 
polarizability. The polarizability from both effects is the sum 


a — ao + 


3 kT' 


( 6 . 10 ) 


The term ao from the induced moment is independent of T, so the two effects 
can be disentangled. Equation (6.10) is called the Langevin-Debye formula. De- 
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bye used experiments on dielectric effects to measure a, and thereby determine 
molecular size and structure. 

In Sec. 6.3 we will calculate the response of a macroscopic dielectric sample 
to an applied electric field. The input to the calculation is the polarizability a of a 
single atom or molecule. 


6.2 ■ POLARIZATION AND BOUND CHARGE 

To account for the effects of atomic polarizability, it is necessary to define a new 
field (in this case a kind of matter field) called the polarization P(x). The polar¬ 
ization is a vector function of position in the dielectric. P(x) is defined to be the 
mean dipole moment density. This requires some explanation. The polarization at 
x is the dipole moment per unit volume, averaged over a subvolume that is small 
compared to the macroscopic scale of the system but large compared to a single 
atom, 


j 8N 

P(x) = (6 - n) 
i=l 

In the equation, p ( denotes the dipole moment of the ; th atom in S V, and 8 N is 
the number of atoms. Because the number of atoms is very large, even in a small 
8 V, P(x) is a smooth function of position. The fluctuations of individual atomic 
dipoles are washed out by averaging over many atoms. 

The vector field P(x) is analogous to the scalar function p(x), the macroscopic 
charge density, which is similarly defined as an average over subvolumes contain¬ 
ing many atoms. In the case of p the quantity that is averaged is the charge. In the 
case of P it is the dipole moment. In fact, P(x) and p(x) are related, and we now 
derive the relation. 

Polarization is a shift of charge—positive charge being displaced down-field 
and negative charge up-field. In equilibrium without an applied field the mean 
charge density is 0 throughout the dielectric; i.e., the charge density due to elec¬ 
trons is equal but opposite to that due to the atomic nuclei. But when there is 
polarization, the displacement of charge may create a nonzero net charge density 
at some points in the material. The mean charge density resulting from polariza¬ 
tion is called the bound charge density, and denoted pb(x). We shall prove that 

p b (x) = —V • P. (6.12) 

This result is so important that it is worthwhile to study two different derivations 
of it. 

For the first derivation, recall the potential of an electric dipole p, 

p • (x — x') 

V(x) = 

47reo|x — x'| J 


(6.13) 
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where x' is the position of the dipole. The potential due to the superposition of all 
dipoles in the dielectric is 


f (x - x') • P(x')<* V 
J 4;reo|x-x'| 3 


(6.14) 


because P {x')d^x' is the net dipole moment from all atoms in d?x'. The inte¬ 
gration region is the whole dielectric volume. Now we shall rewrite (6.14) in a 
familiar form. First, note that (x — x')/|x — x'| 3 is equal to V'(l/|x — x'|), where 
V' is the gradient operator for x'. Next, convert the integrand by the identity 


, «-( v irhi)- v '-(ir£r) 


V'-P(x') 
lx — x'l • 


(6.15) 


Finally, apply Gauss’s divergence theorem to the volume integral of the first term 
on the right side of (6.15)—a total divergence—obtaining a surface integral over 
the boundary of the dielectric. The result then becomes 


V(x) = 


( P(x') • n dA' 
s 4tre 0 |x - x'| 


r V' •P(x , )^ 3 .x' 
J 4^e 0 |x — x'| 


(6.16) 


where S is the surface of the dielectric. These integrals have forms that we have 
met previously in electrostatics, as potentials due to surface charge and volume 
charge, so we write 


V (x) 


-l 


cr^dA 1 


S 4jre 0 |x - x' 


+ 


/ 


p b (x')d 3 x' 
47reo|x — x'| 


(6.17) 


The volume function Pb(x') is —V' • P(x'); the surface function crblx') is 


CT b = n P 


(6.18) 


where n is the outward unit normal vector at the point x' on S. The interpretation 
of (6.17) is that the surface charge density on S is CTb(x') and the volume charge 
density is Pb(x'), hence proving (6.12). 

The above derivation of (6.12) is rather formal, involving several mathematical 
steps whose physical meaning is obscure. Therefore, we should study another, 
more physical derivation of the result. Consider an arbitrary volume V entirely 
inside the dielectric. When the dielectric is unpolarized the total charge inside V is 
0. As the material becomes polarized, in response to an applied field, some charge 
will cross the surface S of V, either because atoms near S become distorted, 
or because polar molecules near S align with the field. Let Qacross be the net 
charge that moves across S outward from V as the material becomes polarized. 
By conservation of charge, the net charge in V after the dielectric is polarized is 
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FIGURE 6.3 An ideal dipole—charges ±q separated by distance £—straddling a surface 
element dA, with p normal to the surface. If the center of the dipole is within distance 1/2 
above or below dA, then the positive and negative charges are on opposite sides of d A. 
The volume d V occupied by the atoms that straddle the surface is d V — Id A. 


' Q across. SO 


i 


Pbd 3 x 


= -<2a 


(6.19) 


where pb(x) is the bound charge density of the polarized material. 

To calculate gacross, consider a small patch d A on S. If an atom is close enough 
to dA then it will straddle dA when polarized, as illustrated schematically in Fig. 
6.3, and so contribute to (Across- We’ll calculate d (Across, the charge crossing the 
small patch dA, by treating the atomic dipole as a pair of charges +q and —q 
with vector i from — q to q. The dipole moment is qi. If the center of a polarized 
atom lies within distance £„/2 of the patch dA, inside or outside V (see Fig. 6.3) 
then net charge q will cross dA in the outward direction; note that l„ = n • i is 
the component of i normal to dA. If the center of the atom is inside V (within 
distance £„/2 of dA) then +q crosses dA in the +n direction when the atom is 
polarized; if the center of the atom is outside V then — q crosses dA in the —n 
direction. In either case net charge q crosses dAin the outward direction. The 
total charge crossing dA is q times the number of atoms in the volume l n dA\ 
that is, d Qacross = q ( n a dV ), where n a is the atomic density. But ql n n a is just 
the normal component of the polarization field, so e/Qacross = P n dA. Integrating 
over the entire surface S yields 

Qacross — & P n dA = [ V • P d 3 x, (6.20) 

Js Jv 

the second equality by Gauss’s theorem. Comparing (6.19) and (6.20), which must 
hold for any volume V in the dielectric, we see that pb = —V • P, as claimed. 

If the polarization field is uniform, i.e., independent of x, then Pb(x) is 0. In 
this case the bound charge lies only on the surface of the dielectric, with density 
CTb = n • P. If the polarization varies within the dielectric, then there is nonzero 
bound charge at points where P(x) diverges, with density —V • P. Divergence of 
polarization, not merely polarization, produces bound charge. 

There is a simple graphical demonstration that uniform polarization produces 
a surface charge on the boundary. On a sheet of paper draw a rectangle and fill 
it with small blue dots, which represent the positive atomic nuclei. On a trans¬ 
parency draw an identical rectangle but fill it with small red dots, which represent 
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the negative electrons. First overlay the two rectangles, which together represent 
an unpolarized sample. The blue and red dots cancel throughout, so the charge 
density is 0. Now, to simulate uniform polarization, move the transparency a small 
distance. There will be a layer of blue dots, i.e., positive ob, on one side, and a 
layer of red dots, i.e., negative <7b, on the other side. In the volume, the blue and 
red still cancel, so p\> is 0. (We could demonstrate volume bound charge if the 
transparency were stretchable!) 

We have concentrated on the bound charge, a property of the polarized ma¬ 
terial. Now what about the field? The bound charge creates an electric field, so 
the dielectric becomes a field source. One way to calculate the field in the pres¬ 
ence of a dielectric is first to determine the bound charge densities, p\, and ob, 
and then apply the standard methods for finding the field due to a known charge 
distribution. 


EXAMPLE 1 Determine the bound charge density and the electric field of a uni¬ 
formly polarized sphere. 

Figure 6.4 shows the sphere of radius a. Let the z axis be in the direction of the 
polarization P. Then for r < a. P(x) is Pok where Po is a constant. The volume 
charge density fto is 0, because V • P = 0. The surface charge density on the 
spherical boundary r = a is ob(0) = ? • P = Po cos 6. 

Figure 6.4(a) shows why polarization produces surface charge in this example. 
For 0 <9 < 7 r /2 the surface density is positive, because there the positive ends of 
atomic dipoles are at the surface; for 7 t/2 < 6 < n the surface density is negative 
because the negative ends are at the surface. 

We may construct the potential function for the polarized sphere by the meth¬ 
ods of Chapters 4 and 5. The mathematical problem is identical to a spherical 
surface with charge density a(0) — Po cos 0. The interior potential is C\r cos 6, 
and the exterior potential is Cir~ 2 cos6, where C\ and C 2 are constants. These 
functions satisfy Laplace’s equation for r < a and r > a. The constants are de- 



FIGURE 6.4 Example 1. A uniformly polarized dielectric sphere, (a) The atomic 
dipoles inside the sphere produce a bound charge on the surface, with density a (9) = 
P cos0. (b) Electric field lines of the sphere. 
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termined by the boundary conditions at r = a. The potential must be continuous 
at r = a. 


Vint {a, 6) = Vext (a, 9), 


( 6 . 21 ) 


which implies Cj = Cia 3 . The normal component of E has a discontinuity at 
r = a proportional to the surface charge density 


Er,t\ t(0, 9) E r , int(fl, 0) 


ob (9) 
eo 


P 0 cos 9 
eo 


( 6 . 22 ) 


andthis implies C\ = Pq/2>€q. Thus the potential function for the polarized sphere 
is 


Vint(r, 9) 


Vext (r,9) 


-— cos 9 for r < a, 
3e 0 


Ppa 3 
3 e 0 r 2 


COS0 


forr > a. 


(6.23) 

(6.24) 


The electric field is —VV. Inside the sphere E is — Pok/(3eo), a uniform field 
in the —k direction. Outside the sphere E is a pure dipole field. The dipole poten¬ 
tial is p cos 6/(4jTeor 2 ), so the dipole moment of the sphere is 

Poa 3 « 4 

p = 4^e 0 ——k = -jza 3 P. (6.25) 

3e 0 3 

The result makes sense: The dipole moment is the moment density (P) times the 
volume of the sphere. Figure 6.4(b) shows the electric field lines for a uniformly 
polarized dielectric sphere. 


6.3 ■ THE DISPLACEMENT FIELD 

The fundamental equations for static electric fields are V x E = 0 and V ■ E = 
p/eo- These equations are true in vacuum or in a dielectric. In either case p must 
include all the charge sources. In a dielectric there may be charge density from 
bound charge in addition to any other charge that is present. The bound charge 
depends on the behavior of the atoms that compose the dielectric. To determine 
the field and charge, we must know how the particular dielectric material responds 
to an electric field. 

To analyze a dielectric system, it is useful to separate the charge density into 
bound charge Pb(x), and free charge pf(x), by which we mean the charge placed 
in the system from the outside; 

p(x) — P b (x) T p f (x). (6.26) 

The total charge density is equal to eoV • E, and the bound charge density is 
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—V • P by (6.12). So two of the terms in (6.26) are divergences, and we may 
combine these to write the equation as 

V • (e 0 E + P) = p f . (6.27) 

The combination eoE + P is a new vector function denoted D(x), called the dis¬ 
placement field. 

The basic equations for the displacement field are its definition 

D = e 0 E + P (6.28) 


and its field equation 


V • D = p f . (6.29) 

By definition (6.28), D consists of a field term plus a matter term. The field equa¬ 
tion (6.29) is another form of Gauss’s Law. The corresponding integral relation 
is 


(j) D • dA — (2f)enclosed • (6.30) 

The important point is that the field equations (6.29) or (6.30) relate D to the 
free charge density alone, i.e., without reference to the bound charge. In a highly 
symmetric system, for which the form of D(x) can be deduced from the symmetry 
alone, the “Gaussian surface trick” can be used to determine D(x) from the flux 
integral. 

Note an important difference between D(x) and E(x). The electric field is irro- 
tational, V x E = 0, so E may be written as the gradient of a scalar. But in general 
V x D is not 0. Also, there are different boundary conditions for E and D, which 
will be derived in Sec. 6.5. 

A general problem in electrostatics with dielectrics is to determine E(x) and 
P(x) for a given set of free charges or applied fields. We may imagine an exper¬ 
iment. The observer can set up charges, conducting plates, insulators, etc., and 
then measure the electric field throughout the space. However, the observer does 
not get to specify the polarization in a dielectric, because that is determined by 
atomic or molecular properties of the material. 

A common strategy for solving systems with dielectrics is to first find D for 
the specified free charge, using the symmetry and Gauss’s Law (6.30). But once 
D is known, we still need more information to calculate E or, equivalently, P. 

To predict the behavior of the electrostatic system we must know how the 
dielectrics respond to an applied field. In principle P(x) could be calculated by 
solving the Schrodinger equation for all the bound electrons in the system, but that 
would not be practical. Rather, we need an empirical connection between P(x) 
and E(x), i.e., one more equation to close the set of simultaneous equations. The 
equation that relates P and E, or equivalently D and E, is called the constitutive 
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equation. Here we will consider only the simplest case, in which the constitutive 
equation is an isotropic linear relationship. 

6.3.1 ■ Linear Dielectrics 

An insulating material for which P(x) is proportional to E(x), with a scalar con¬ 
stant of proportionality, is called an isotropic linear dielectric. All dilute gases 
are linear dielectrics, because the isolated molecules have dipole moments pro¬ 
portional to E. Many other materials, especially liquids and amorphous solids 
like glass and plastics, are accurately described as linear dielectrics. Crystals are 
different. While there may be a linear relation between P and E, in general it is a 
tensor equation because in a crystal P may have nonzero components perpendic¬ 
ular to E. 

Only one parameter is needed to relate P and E in an isotropic linear dielectric. 
However, three constants are conventionally used for the constitutive equation: the 
susceptibility x e , the permittivity e, and the dielectric constant k. They are defined 
in Table 6.2. These parameters are not independent; any one determines the other 
two. For example, from D = eoE + P and D = eE, we derive a relation between 
e and Xe, 


e = e 0 (l + Xc); (6.31) 

then by the definition of k 

k = l + Xe- (6.32) 

Note that Xe is always > 0, because the atomic dipoles align with E so that P 
points in the same direction as E. Therefore e > eo and k > 1 for any dielectric 
material. 

We can now complete our strategy for solving symmetric problems with di¬ 
electrics. First find D from the free charge. Then calculate E and P from the linear 
constitutive equations. Finally, p and pb can be determined from E and P. 

In vacuum, the susceptibility is obviously 0, so e = co and k = 1. In matter, 
these macroscopic parameters are ultimately determined by atomic properties, 
such as the atomic polarizability a and number density n. In a dilute gas the inter¬ 
actions between atoms are negligible, so each atom has dipole moment crE, and 
the polarization is P = naE; in that case, y e = na/e o and k = 1 + na/e o- The 


TABLE 6.2 Parameters for an isotropic linear dielectric 
with their definitions 


Parameter 

Symbol 

Defining equation 

susceptibility 

Xe 

P = / e e 0 E 

permittivity 

€ 

D = ( E 

dielectric constant 

K 

k = e/e 0 
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TABLE 6.3 Dielectric properties of insulators 


Material 

Dielectric constant 

K 

Dielectric strength 
£ max in 10 6 V/m 

air 

1.00059 

3 

polystyrene 

2.5 

20 

Lucite 

2.8 

20 

Plexiglas 

3.4 

40 

Teflon 

2.1 

60 

Mylar 

3.1 


paper 

3.7 

16 

fused quartz 

3.8 to 4.1 


Pyrex 

4 to 6 

14 

water 

80 


strontium titanate 

332 

8 


dielectric parameters of a gas are not very different from vacuum because n is 
small. For example, the dielectric constant of air is 1.00059. In condensed matter 
the relation between k and atomic parameters is more complicated, because each 
atom is influenced by nearby atoms. The dielectric parameters of condensed mat¬ 
ter may be very different from vacuum. For example, the dielectric constant of 
water is 80. Table 6.3 lists the dielectric constant for some insulators. 

Table 6.3 also lists the dielectric strength of the materials. The dielectric 
strength is the maximum electric field strength £ max for which the material re¬ 
mains an insulator. If the field strength exceeds E max , then electrons are pulled 
away from the nuclei of atoms or molecules and become free to move through the 
material, making it a conductor. The process when current begins to flow is called 
dielectric breakdown. It is familiar in air as a spark. The dielectric strength is an 
important parameter if the material is used in a capacitor, because it sets a limit 
on the maximum potential that the capacitor can maintain without breakdown. 

6.3.2 ■ The Clausius-Mossotti Formula 

We have seen that polarization of atoms or molecules by an electric field produces 
a macroscopic polarization field P(x), equal to / e eoE(x) for a linear material. The 
purpose of this section is to derive a relation between the microscopic parameter a 
(atomic polarizability) and the macroscopic parameter Xe (electric susceptibility). 

In this derivation we approximate an atom as a small conducting sphere. Un¬ 
doubtedly this model is an oversimplification. After all, an atom consists of a 
finite set of electrons in quantum states, rather than a continuous distribution of 
free charge in a sphere. Nevertheless, this simple model yields a reasonable theory 
of susceptibility. 

The polarization in a dielectric is 


P(x) = np 


(6.33) 
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where n is the atomic density and p the mean atomic dipole moment of atoms in 
the neighborhood of x. We have previously defined the atomic polarizability a by 
the equation p = aE, but now we need to be more precise in stating what field 
E contributes to the polarization. The total field E in the neighborhood of x is the 
sum of the field due to the atom itself E se if plus the field produced by all other 
sources E 0 ther- The field that polarizes the atom is E 0 ther» so 

P = a (E(x) - Eseif) i (634) 

the atom does not polarize itself, so we discount E se if. 

In a dilute gas the dielectric effect is small, so E 0 ther and E(x) are both approx¬ 
imately equal to the externally applied field. Then P « na E, and Xe ~ not/e o; 
note that Xe 1 for a dilute gas. But in condensed matter E ot her and E(x) differ 
significantly from the applied field, and from each other, so the calculation of Xe 
is more intricate. 

Treating the atom as a conducting sphere we can use results from Chapter 4 to 
estimate E se if- If a conducting sphere of radius a is placed in a uniform field Eo 
then it acquires a dipole moment p = 47ra 3 f qEo; the external field produced by 
the sphere itself is a pure dipole field. The total field, inside the conducting sphere 
is, of course, 0, because a conductor is an equipotential. Therefore the self-field 
inside the sphere is — Eo, which cancels the uniform applied field. So, in this 
model of the atom 


Eseif = “Eo = 


~P 

4?ra 3 eo 


(6.35) 


The field produced by a conducting sphere in an applied field has the same form 
as the field of a uniformly polarized sphere (which we found in Example 1) so 
another way to characterize this model is that it approximates the atom by a po¬ 
larized sphere. 

To define the radius a of the model atomic sphere, we assume the atoms are 
densely packed, and set equal to the mean atomic volume in the material, 
v = 1 /n. Then by the results of the previous paragraph, the self-field is 


np P 

Eseif = — Z = — Z • 

3eo 3eo 

Inserting this result into (6.34) the polarization is 
P = np = n« (e + 


(6.36) 


(6.37) 


Thus there is indeed a linear relation between P and E, and the susceptibility, 
defined in Table 6.2, is 


Xe = 


na/e p 

1 - na/( 3e 0 )' 


(6.38) 
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Or, we may solve for the atomic polarizability 

_ <?o Xe _ 3e 0 k - 1 
01 n 1 + Xg/3 n k + 2' 


(6.39) 


where in the second equality k is the dielectric constant, k = 1 + Xe- 

The relation (6.39) is called the Clausius-Mossotti formula. It was first ob¬ 
tained by Mossotti in 1850, by treating the atom as a conducting sphere. The 
derivation was refined by Clausius in 1879. Remarkably, this formula for the 
atomic parameter a was known before the structure of the atom was known. Of 
course a precise theory of the electric susceptibility must use quantum mechanics 
and modem condensed matter physics. But (6.39) does provide a reasonably accu¬ 
rate theory of the connection between microscopic and macroscopic parameters. 
For example, from the measured dielectric constant of water we may estimate the 
dipole moment of a water molecule. 2 


6.3.3 ■ Poisson's Equation in a Uniform Linear Dielectric 

In a linear dielectric, D = eE = —eVV. Now suppose the material is uniform, 
i.e., e is constant. In a region with no free charge, V ■ D is 0 so V(x) satisfies 
Laplace’s equation. In a region with free charge density pf(x), V (x) satisfies Pois¬ 
son’s equation 

-V 2 y = /Of/e. (6.40) 

The source is written as pf/e, rather than p/eo; these are in fact equal, but pf is 
the quantity that can be specified in an experiment. For a given distribution of 
free charge, along with appropriate boundary conditions, (6.40) determines F(x). 
From V (x) any other electrostatic quantity can be calculated. So, in a uniform lin¬ 
ear dielectric, electrostatics again reduces to a boundary value problem for V(x). 
We should emphasize that (6.40) is valid if the dielectric is uniform. 


EXAMPLE 2 Suppose two free charges are embedded in a uniform dielectric. 
What is the force on one of the charges? 

Let the charges be q\ located at xj, and c /2 at X 2 , as shown in Fig. 6.5. The free 
charge consists of q\ and q 2 - The force on q\ is F/ = q\E 2 (\\), where E 2 (x) is 
the electric field due both to q 2 and to the polarization produced by qj. The full 
electric field is — VV, where V(x), satisfying Poisson’s equation with the source 
Pf/e, is 


V (x) = 




+ 


<12 


47re|x —xi| 47re|x —X 2 


(6.41) 


The potential differs from the potential in vacuum by the constant factor eo/e = 
1 /k. The second term in (6.41) is the contribution attributable to the charge q 2 


z See Exercise l 
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FIGURE 6.5 Charges q\ and q 2 in a dielectric material. 


including the polarization caused by q 2 , and —V of that term is E 2 . Thus 


q x q 2 (xi x 2 ) _ Fi vacuum 
4^e|xi—X 2 I 3 k 


(6-42) 


F 1 vacuum means the force on q\ if no dielectric were present. Because k > 1, 
the force on q\ is less than it would be in vacuum. The charge q 2 is partially 
“screened” by bound charge of the opposite sign, which reduces its field strength 
relative to vacuum. Therefore F\ < F\ vacuum- 


In general the density of bound charge in a uniform dielectric is 

-l)pt(x). (6.43) 

The bound charge has the same x-dependence as the free charge, but opposite 
sign. For example, the bound charge associated with polarization by a point charge 
is itself a point charge of the opposite sign. 3 Equation (6.43) means that inside a 
uniform dielectric there can be bound charge only where there is free charge. 
Also, the total charge density (pb + Pf) is P(x) = pf(x)//r. 



6.4 ■ DIELECTRIC MATERIAL IN A CAPACITOR 

Historically, Faraday conducted the first systematic study of dielectrics. He found 
that if a dielectric material is placed in a capacitor then the capacitance increases. 
The ratio of the capacitance C with the dielectric to the capacitance C va cuum with¬ 
out the dielectric was the historical definition of the dielectric constant, and we 

3 See Exercise 9. 
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FIGURE 6.6 A parallel plate capacitor filled with a slab of dielectric material. The 
atomic dipoles produce positive bound charge on the upper surface of the slab, and negative 
on the lower. Let the z axis be perpendicular to the plates. 


will show that the ratio is just k, the parameter defined in Table 6.2. The relation 
C / Cvacuum = k provides a method for measuring k. 

Consider a parallel plate capacitor, as shown in Fig. 6.6, with plate area A 
and separation d, filled with an insulating material with dielectric constant k. The 
potential difference across the plates is V; the plates have charge ±Q, the free 
charge in this system. By Gauss’s Law, the displacement field between the plates 


D = ofk = 



(6.44) 


To prove this result, apply (6.30) to a pill-box shaped Gaussian surface cutting 
through one of the plates. The electric field E between the plates is D/e, so the 
potential difference is 


V = [ E-d£ = 

J- eA 

Thus the capacitance is 

r _Q_A 

V d ‘ 


(6.45) 


(6.46) 


In terms of the capacitance without the dielectric (C va cuum = eo A/d) the capaci¬ 
tance with the dielectric is 


C — K Cvacuum (6.47) 

where k = e/e<). The presence of the dielectric increases the capacitance by the 
factor k . We have analyzed a parallel plate capacitor, but the same result is true 
for any generalized capacitor embedded in a dielectric. 4 5 

4 The z axis is normal to the plates. 

5 See Exercise 12. 
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6.4.1 ■ Design of Capacitors 

In the most general terms a capacitor consists of two conductors in close proxim¬ 
ity. The capacitance is C = Q/V, where V is the potential difference when their 
charges are +Q and —Q. But in practical terms a “capacitor” means a circuit 
element that stores charge and energy. A pair of metal plates or foil sheets sep¬ 
arated by an insulator—air or a solid dielectric—is a common type of capacitor 
in circuits. The sandwich of two conductors and an insulator filling may be rolled 
or folded into a compact space. A coaxial cable is another kind of capacitor, in 
which charged regions travel along the cable as a wave. 

There are several reasons for putting a dielectric material between the conduc¬ 
tors of a capacitor. The charge that flows to a capacitor is supposed to pile up on 
the conductors, but not move through it. The solid dielectric holds the conductors 
apart, preventing charge from passing from one to the other. Also, the presence 
of the dielectric increases the capacitance, which is desirable if large capacitance 
is needed in a small volume. Finally, if the material has a large dielectric strength 
then breakdown will not occur at high voltage. 

For a parallel plate capacitor, we have C = keq A/d. For typical circuit capac¬ 
itors C is of the order of pF up to jrF. These values can be achieved by making 
A large and d small, e.g., by folding or winding large thin sheets of metal foil 
separated by thin dielectric layers such as mica or paper. No special technique is 
needed to make capacitors in this range of C. But for large capacitance, of order 
1F, or even mF, clever designs are required. Two stategies are commonly used to 
make C large. 6 

In a multilayer ceramic capacitor, metal strips are folded into many layers 
and separated by a ceramic insulator such as barium titanate, which may have a 
dielectric constant as large as 2 x 10 4 . The combination of multiple layers and 
large k can achieve C of order mF. 

In an electrolytic capacitor, the dielectric is a very thin layer of metal oxide. 
One conductor is a sheet of metal or metal foil on which the metal oxide is de¬ 
posited. The other conductor is a conducting acid paste or liquid making good 
contact with the oxide layer. Aluminum, aluminum oxide, and hydrochloric acid 
are the classic materials in an electrolytic capactor. The thickness d of the oxide 
layer may be as small as 10 -8 m. Furthermore, the metal surface may be etched 
into sharp peaks and valleys before depositing the oxide, to increase the surface 
area A. The combination of small d and large A can achieve C of order 1F or even 
larger. However, there are some limitations. Electrolytic capacitors are limited to 
low voltages, typically a few volts, because the breakdown voltage for such a thin 
insulating layer is small. Also, an electrolytic capacitor has a definite directional 
polarity: The metal foil must be positive with respect to the acid solution, because 
otherwise electrolytic action will destroy the oxide layer. 

In some applications the capacitance must be kept small. For example, large 
C in a coaxial cable implies a long time constant for changes of current, and is 
therefore not suitable for fast switching. 

®D. M. Trotter, Jr., Capacitors, Sci. Am., July 1988. 
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Small Capacitance in Computer Design. An actively pursued research problem 
with important commercial implications is to find insulating materials with very 
low k, which can be used to separate the small wires and transistors in chips for 
the next generation of computers. Currently computers use fused quartz (SiC> 2 ) 
for this purpose, but as the elements on computer chips get squeezed closer to¬ 
gether, the capacitance between neighboring conductors increases. Also, as the 
conductors are made thinner their resistance increases. The characteristic time of 
RC circuits is 7 r = RC, so to have circuits with fast switching times, and to keep 
signal delays and crosstalk small, it is important for intra- and interlayer capaci¬ 
tive coupling to be small. The ideal is to have insulators with k « 1, like air, so 
one approach is to use porous materials. As usual with engineering applications, 
there are a myriad of associated problems; e.g., the materials must be thermally 
stable, not chemically active with the Cu wires, and convenient to deposit. 

All the capacitors described above would have fixed C. Variable capacitors, 
such as are used in radio tuners, have interleaved plates, with one set of plates 
fixed and the other free to rotate, so that the overlap area of the plates can be 
varied. The capacitance increases as the overlap area increases. 

Historically, the first capacitor was the Leyden jar, invented in 1745 by 
van Musschenbroek in the Netherlands. The original Leyden jar consisted of 
a glass container partly filled with water, and a metal wire or chain with one 
end immersed in the water and the other end exposed. The Leyden jar would be 
charged by bringing the metal chain in contact with a machine that generated 
static electricity by friction. An improved version had separate metal foils coating 
the inner and outer surfaces of the glass container, making a capacitor capable of 
holding a large charge. 

During the eighteenth century, demonstrations of static electricity with Leyden 
jars were a popular entertainment in Europe. Discharging a charged Leyden jar 
through human volunteers could produce startling, even fatal, effects! In 1752 
Benjamin Franklin showed that lightning is an electric discharge in his famous 
kite experiment. He collected charge from a cloud by charging a Leyden jar from 
a metal key attached to the wet kite string. The Leyden jar charged by the kite 
behaved exactly the same as if it had been charged by an electrostatic generator. 

6.4.2 ■ Microscopic Theory 

Why, physically, does the dielectric in a capacitor increase the capacitance? To 
answer this question, let’s consider again the parallel plates shown in Fig. 6.6. 
For a given potential V, a larger charge Q resides on the plates if the dielectric 
is present, i.e., if C is larger. As we have learned, there is bound charge of oppo¬ 
site sign on the dielectric surfaces, which partially screens the free charge on the 
plates. 

Figure 6.6 illustrates the atomic dipoles aligned with the electric field in the 
dielectric slab. At the upper surface of the slab there is a positive bound charge 

7 See Section 7.5. 
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density <7b, u due to the positive ends of dipoles at the surface; at the lower surface 
the charge density is <7b,i = The net bound charge is 0. The surface density 

is <7b, u = P = by (6.18). Thus the electric field E z between the plates is 

a superposition of two terms: Q/{Ae o) due to the free charge, which is the same 
as for vacuum between the plates; and — ob, u / e o due to the bound charge, which 
equals —XeE z . That is, 



Solving for Q we find that the free charge for a given field strength E z = V/d is 

Q = Ae 0 (\+Xe)E z = ^. (6.48) 

d 

Q is larger for a dielectric-filled capacitor than for a vacuum-filled capacitor by 
the factor k = e/eo- 


6.4.3 ■ Energy in a Capacitor 

In Chapter 3 we showed that the field energy in an electrostatic system is 
(eo/2) f E 2 d i x. But this is not the total energy stored in a charged capacitor 
with a dielectric, because there is also some strain energy associated with the 
distortion of the polarized atoms. We can derive the total energy t/ cap of the 
capacitor, by calculating the work done to separate the free charge ±Q. If charge 
dQ is moved from the negative plate to the positive, then the work done is 
dW = VdQ, where V = Q/C. The total work that must be done to transfer 
charge Q, which is equal to U cap , is 


U, 


cap 


-L 


Q Q Q 2 
dQ = * 


C 


2 C 


(6.49) 


But compare f/ ca p to the field energy t/fkid • The electric field between the plates 
is Q/(eA), and the volume is Ad, so 


t/field = y 




(6.50) 


where we used (6.46) in the second step. The field energy is smaller than the total 
energy by the factor \/k. The additional energy in [/ cap is equal to the work that 
must be supplied to pull the atoms into induced dipoles, against the internal forces 
of the atom. 

The result (6.49) can be generalized. We shall show that the total energy of a 
linear dielectric is 



J D E d 3 x. 


(6.51) 
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This agrees with (6.49) because it implies the result U = t/ C ap = kU&m that we 
found before. 

To justify (6.51) in general, consider the work done to change the free charge 
density by a small amount Spf. The energy added is SU = f SpfVd 3 x. Now, 
Spf = V • (<5D) by Gauss’s Law, so 

SU = J V • (<$D) VdP'x = j SD • (—VV) d 3 x. (6.52) 

To obtain the second equality in (6.52) we have integrated by parts and discarded 
the surface term at infinity. But — VV is E, and for a linear dielectric 

<5D • E = e<5E • E = € -S(E 2 ) = ^<5(D • E). (6.53) 

Thus SU is equal to S ^ / D • Ed 3 xj for an arbitrary change, proving (6.51). We 

may interpret • E as the total energy density in a linear dielectric, including 
both field energy and strain energy of the atoms. 8 


EXAMPLE 3 A parallel plate capacitor has plate area A and separation d. Denote 
by U the total energy if the space between the plates is filled with a dielectric with 
permittivity e, and by Uq the energy if the space is vacuum. 


Case 1. Calculate U/ Uq if the charge on the plates is specified to be ±Q. 

The displacement field is D = <jfA, where Of = Q/A and A is normal to the 
plates. (Prove this from Gauss’s Law (6.30).) It is important to understand that 
in this case D is the same for dielectric or vacuum, because the free charge is 
specified to be the same. The energy U for the dielectric-filled capacitor is 



D l 

■Ed 3 x = —Ad 
2e 


Q 2 d 
2eA ’ 


which is just Q 2 /2C. The energy for the vacuum-filled capacitor is 


The ratio of the energies is 


£/o = 


Q 2 d 
2 e 0 A' 


U_ _ eo = 
f/o e k ' 


(6.54) 


(6.55) 


(6.56) 


For specified charge, the dielectric-filled capacitor has less energy than the 
vacuum-filled capacitor. 

8 Note that (6.51) is not true for nonlinear dielectrics. 
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Case 2. Calculate U/ Uo if the potential difference between the plates is speci¬ 
fied to be V. 

The electric field is E = n V/d because f E ■ dl = V. It is important to 
understand that in this case E is the same for dielectric or vacuum, because V is 
specified to be the same. The energy U for the dielectric-filled capacitor is 



£ j^2 

D • Ed 2 x = - Ad 

2 


V 2 eA 
2d ’ 


(6.57) 


which is just \CV 2 . The energy for the vacuum-filled capacitor is 


Uo = 


V 2 eoA 

2d 


Thus the ratio of the energies is 


U _ e_ 
Uq 60 


(6.58) 


(6.59) 


For specified potential difference, the dielectric-filled capacitor has more energy 
than the vacuum-filled capacitor. 


6.4.4 ■ A Concrete Model of a Dielectric 


By formal considerations we found that the energy density in a linear dielectric is 
|D • E, where D = eE is the displacement field. This energy includes both field 
energy and strain energy of the polarized atoms. Can we understand this result 
from more physical considerations? A complete theory of a polarized atom would 
require quantum mechanics, but we can gain some insight from a classical model. 
Treat the atomic dipole as a pair of charges +q and —q bound together by a 
spring force, with potential energy A Kx 2 where K is the spring constant and x is 
the separation of the charges in the direction of the applied electric field. We now 
analyze a dilute gas of these model atoms, to leam something about a dielectric 
in a field E = £o> • 

The model atom is in static equilibrium, with charge separation x eq , when 
the electric force qEo balances the spring force Kx eq on q (or —q)\ that is, 
x eq = qEo/K. Then the dipole moment is qx eq = q 2 Eo/K , and therefore the 
polarization field is P x = np x = nq 2 Eo/K, where n is the atomic density. The 
displacement field in the material is 


D x = e 0 E x + P x = eo 



E o. 


(6.60) 


Now, what is the energy density, ul According to our formal result (6.51), 


1 


eo 


u — —D • E = — ( 1 H~ 


nq 

eoK 


E 2 . 


(6.61) 



208 


Chapter 6 Electrostatics and Dielectrics 


The first term on the right-hand side of (6.61) is the energy density of the electric 
field, t<field = eoi’o/2. The second term is the potential energy per unit volume 
stored in the atomic springs, because 


1 nq 2 

YY 


E 0 = ^ Kx tq n = « 


* spring- 


(6.62) 


So this classical model shows that 


jD • E = «field + M spring. 

and verifies the interpretation of • E as the total energy density in the system 
of atoms and field. 


6.5 ■ BOUNDARY VALUE PROBLEMS WITH DIELECTRICS 

Something important that we need to leam about dielectrics, and remember for 
later applications, are the boundary conditions for E and D at dielectric surfaces. 
In this section we derive them first, and then use them in some interesting exam¬ 
ples. 

6.5.1 ■ The Boundary Conditions 

Because V x E = 0, the tangential components of E are continuous across any 
surface in space. To prove this statement, consider Figure 6.7(a) which shows 
a surface S cut by a small loop T. The loop integral E • di is 0 for any T, 
by Stokes’s theorem. Now let T shrink, until its width across S approaches 0; 
the opposite sides of T then coincide with a curve C on S. The loop integral 



FIGURE 6.7 Derivation of the boundary conditions at a surface, (a) The tangential 
component of E is continuous, by applying Stokes’s theorem to f. (b) The normal com¬ 
ponent of D jumps by Uf, by applying Gauss’s theorem to £. These conditions might be 
applied, for example, at the interface between two dielectrics. 
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approaches f c (E 2 ~ EO-ids, where E 2 andEi are the field vectors just above and 
below S, respectively, and t is the tangent to C. The fact that this integral is 0 for 
any curve C implies that the tangential components of E are continuous across S, 
as claimed. This is the same argument as in Sec. 3.3.2 in slightly different terms. 

Gauss’s Law, V • D = pf, implies that the normal component of D is continuous 
across S, unless there is a free sutface charge, in which case L> 2 n — D\ n = rtf. 
To prove this statement, consider Fig. 6.7(b) which shows a Gaussian pillbox E 
cutting through S. In the limit that the height of the pillbox approaches 0, the 
flux of D through E approaches /(T> 2 n — D\ n )dA , where dA is the area of an 
infinitesimal patch on S. By Gauss’s theorem the flux is f OfdA. The fact that 
these surface integrals are equal for any area on S implies that D 2n — D\ n = of, 
as claimed. At an interface between two dielectrics, the normal component of E is 
usually discontinuous because of bound charge on the surface. If there is no free 
charge then D n is continuous, and £20 — E\n = D n (e 2 1 — e j” 1 ). 

The methods of boundary value problems may be used to calculate the fields 
in an electrostatic system with dielectrics, by satisfying the above boundary con¬ 
ditions. The next three subsections illustrate the methods. 

6.5.2 ■ A Dielectric Sphere in an Applied Field 

Figure 6.8 shows a dielectric sphere of radius a in an applied field. If the sphere 
were not present then the electric field would be £ok- The sphere distorts the field 
nearby, but asymptotically E approaches fink. 

Because V x E = 0, we may write E in terms of a potential, 9 as E = — VV. 
There is no free charge so V • D = 0. Assuming the interior is a uniform lin- 



FIGURE 6.8 A dielectric sphere in an applied field. 


9 In general we cannot write D as a gradient. However, for the special case of a uniform linear dielectric, 
D = -fW = -V(eV). 
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ear dielectric (D = eE), and the exterior is vacuum (D = eoE), V • E is 0 in 
both regions, and V 2 V = 0. So, the problem is to solve Laplace’s equation with 
appropriate boundary conditions. 

Applying methods from Chapters 4 and 5, we shall try interior and exterior 
solutions of the form 


Vim(r, 0) = —CircosO for r < a, (6.63) 

C2fl 3 

Vext(r, 6) = —EorcosO H-x—cos0 for r > a. (6.64) 

r l 

These satisfy Laplace’s equation in both regions. Vjnt is finite at r = 0; and V e xt 
has the correct asymptotic limit as r —*■ oc, such that E —*■ £<)k. The constants 
Ci and C 2 must be chosen to satisfy the boundary conditions at r = a. 

For Ei to be continuous at the surface r = a, V must be continuous; therefore 
Ci = Eo — C 2 . The condition for D„ to be continuous is 


—e 


aVin 


dr 


= -e 0 

a 


a Vext 
a r 


a 


therefore kC\ = £0 + 2 C 2 . The solution to these simultaneous equations is 

Ci = -^rfo and C 2 = ^—^Eq. (6-65) 

Some electric field lines are shown in Fig. 6.8. The electric field inside the 
sphere is Cik, uniform in the k direction but with strength reduced by the factor 
3/ (k + 2) compared to the applied field. The field outside is the superposition of 
the applied field and a pure dipole field. Inspecting (6.64) we see that the dipole 
moment is p = 4^eoC2a 3 k- Because C 2 a £ 0 , the dipole moment is propor¬ 
tional to the applied field. The polarizabilty of the dielectric sphere is the constant 
of proportionality a in the linear relation p = aEo. The previous result (6.65) 
implies 


a = 4^eoa 3 ^—+ 2 ) ' (6.66) 

For comparison, recall that the polarizability of a conducting sphere is 4^eoa 3 . If 
k = 1 there is no dielectric and the polarizability is 0. As k increases a increases, 
and when k is large a approaches the value for a conductor. 

Finally, it is useful to stand back and compare the results just obtained for a 
dielectric sphere in an applied field £ok, to the results of Example 1 for a sphere 
with uniform polarization Pok- The external field for the polarized sphere is a pure 
dipole field, while that for the dielectric sphere is a dipole field superimposed 
on the uniform applied field. These results are perhaps expected. The internal 
fields are more surprising. Although in both cases the internal field is uniform, 
the directions are different: Ej m is in the —k direction for the polarized sphere, 
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but in the +k direction for the dielectric sphere. The two problems are closely 
related, because there is uniform polarization of the sphere in both cases. 


6.5.3 ■ A Point Charge above a Dielectric with a Planar Boundary Surface 

Figure 6.9 shows a charge q at distance d from a dielectric. The xy plane is the 
surface of the dielectric. Again, the potential V (x) must satisfy Laplace’s equation 
in both regions z > 0 (except at the position of q) and z < 0. We can construct 
V(x) by the method of images. However, the image charges for a dielectric are 
more complicated than the image charge for a conductor. 

There are different image charges for the regions z > 0 and z < 0. In each 
case the image charge lies in the other region, so that Laplace’s equation holds for 
the contribution to V from the image charge. In the region z > 0, i.e., outside the 
dielectric, the potential is 


TaboveCr, V, Z) — 

4tT6() 


-y/x 2 + y 2 + (z - d) 2 yjx 2 + y 2 + (z T 


dY 


(6.67) 


q is the real charge at (0, 0, d) and q' is an image charge at (0, 0, — d). In the 
region z < 0, i.e., inside the dielectric, the potential is 


Fbelow (x, y, Z ) 


1 q" 

4^0 y/x 2 + y 2 + (z - d) 2 


( 6 . 68 ) 


q" is an effective charge at (0, 0, d) in the region above the dielectric. Note that 
the field inside the dielectric (z < 0) is the same as for a point charge at the 
position of q, but with a different strength q". 

The image charges q' and q" must be chosen so that the boundary conditions 
are satisfied. The condition that V is continuous at z = 0 implies q + q' = q". 



FIGURE 6.9 A point charge above a planar dielectric surface. 
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A brief calculation 10 shows that the condition for D n to be continuous at z = 0 
implies q — q' = Kq". The solution to these simultaneous equations is 

q' = -^—and q”=-^- 7 q- (6-69) 

If k = 1 then there is no dielectric, and the image charges are q' = 0 and 
q" = q; in other words, V (x) is just the Coulomb potential of q. If k 1 then 
the image charges are approximately q' = —q and q" = 0, the same as for a 
conducting plane. 

Of course q' and q" are fictitious. The real charge consists of the free charge q 
and the bound charge cth on the dielectric surface. In the region z > 0 the electric 
field due to ot, is the same as if there were a point charge q' at z = —d. By 
reflection symmetry, in the region z < 0 the electric field due to <7b is the same 
as if there were a point charge q' at z = +d; adding this image charge to the real 
charge q gives the effective charge q" = q + q' at z = d. 

6.5.4 ■ A Capacitor Partially Filled with Dielectric 

Figure 6.10 shows parallel plates of size l x w and separation d, partially filled 
with a dielectric slab. The dimensions of the slab ar ex xw xd, with x < £. What 
is the capacitance? Assume the separation d is small compared to x and £ — x, so 
that end effects are negligible. 

Let the potential difference between the plates be V. The key to this example 
is that the electric field E is the same in the two regions (dielectric and vacuum) 
between the plates. By planar symmetry, the field is uniform in both regions and 
normal to the plates. 11 But E, being normal to the plates, is tangent to the bound¬ 
ary surface of the dielectric. Since the tangential field must be continuous, the field 
is the same on either side of the boundary. Thus the electric field is E = nV/d in 
both regions, where n is normal to the plates. 

To determine the capacitance we must relate V to the charge ± Q on the plates. 
Apply Gauss’s Law to a surface surrounding one plate, e.g., the lower plate: The 
flux of D is equal to Q. That is, since D is eE in the dielectric (of area x w) and 



FIGURE 6.10 A dieleclric slab inserted part-way between charged plates. A force is 
pulling the slab into the space between the plates. 


10 See Exercise 18. 

^Approximating E as uniform neglects the end effects at the end of the dieleclric. 
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eoE in the vacuum region [of area (£ — x)w\, 

e Exw + — x)w = Q. (6.70) 

Substituting E = V/d we find that the capacitance is 

C= § = [ex + e 0 (£~x)]y. (6.71) 

V d 

We might have anticipated this result by noting that the system i s equivalent to 
two capacitors in parallel, with C\ = exw/d and C 2 = e^d — x)w/d. Then the 
combined capacitance is C\ + C 2 . 

There is a related example, in which the dielectric slab has the full area of the 
plates but a thickness less than the separation between the plates. In this case it is 
the displacement field D rather than E that is the same in the slab and the vacuum, 
because the normal component of D is continuous. 12 

The Force on the Dielectric Slab 

The energy formula (6.51) can be used to evaluate the force on a dielectric slab 
inserted part way between oppositely charged plates, as shown in Fig. 6.10. The 
plates have dimensions i x w, and are separated by d. A length x of the dielec¬ 
tric lies between the plates. The slab will experience a force pulling it into the 
capacitor. Qualitatively it is easy to see why the slab is pulled in. The atoms in the 
material are polarized by the electric field, and therefore attracted to the charged 
plates. But the problem is to calculate the force F. We’ll determine F in two 
different ways in the following examples. 


EXAMPLE 4 Plates with fixed charge. Suppose the plates are isolated so that 
their charges ± Q are constant. The work done by the electrostatic force if the slab 
moves a distance dx into the capacitor is F dx. Because the system is isolated, 
conservation of energy says that the work is equal to — d U where U is the total 
energy of the capacitor. Therefore the inward force on the slab is F = —dU/dx. 

To calculate U for this example it is natural to use the equation U = Q 2 /2C, 
because Q is constant. The capacitance is given by (6.71). 13 Thus the energy is 


U(x) = 


Q 2 d 

2w [ex + €q(£ — x)] 


(6.72) 


and the force on the slab is 


Q 2 d (e - e 0 ) 

2 w [ex + eo(^ — x)] 2 


(6.73) 


l2 See Exercise 21. 

13 The capacitance in (6.71) was derived under the assumption that end effects are negligible, which is 
valid if d <K x and d « i — x. 
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For fixed charge the force decreases as x increases. 14 We may also express F in 
terms of the potential difference between the plates, V = Q/C, as 


V 2 W (6 - £p) 
2d 


(6.74) 


EXAMPLE 5 Plates with fixed potential difference. Now suppose the plates in 
Fig. 6.10 are connected to a battery so that the potential difference V is fixed. In 
this case the work done by the electrostatic force F is not equal to —dU, because 
additional energy is supplied by the battery. If the slab moves by distance dx, then 
charge d Q is transferred to the plates from the battery, and so the battery supplies 
energy (d Q) V. The conservation of energy in this case is 

dU = (dQ)V - Fdx. (6.75) 

Here it is natural to use the equation U = C V 2 /2, because V is constant. Also, 
Q = CV so dQ = (dC)V. Inserting these relations into the conservation law 
gives 


Fdx = -\{dC)V 2 + ( dC)V 2 = \{dC)V 2 . (6.76) 


Thus the force on the slab is 


= F 2 w (e - go) 
2 dx 2d 


(6.77) 


This result is the same as (6.74) obtained earlier by considering the plates to be 
isolated. 

If V is held constant then the force is independent of x, according to (6.77). 
This is true if the fraction of the slab between the plates is not too close to 0 or 
1. An approximation has been made in deriving (6.77), that end effects can be 
neglected at the end of the dielectric. This is valid if x d and i — x d, so for 
that range of x the force for constant V is approximately independent of x. But 
if x = l, i.e., the slab is completely inserted, then it is obvious by symmetry that 
the force must be 0, not (6.77). 


FURTHER READING 

1. S. Gasiorowicz, Quantum Physics, 2nd ed. (Wiley, New York, 1996). Chapter 16 de¬ 
scribes the H atom in an electric field. 

2. H. C. Ohanian, Principles of Quantum Mechanics (Prentice Hall, Englewood Cliffs, 
NJ, 1990). Section 10.1 describes the H atom in an electric field. 

14 See Exercise 22. 
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EXERCISES 

Sec. 6.1. Atomic and Molecular Dipoles 

6.1. Consider two electric dipoles pj and P 2 located, respectively, at the origin and on 
the x axis at (a, 0, 0). There are no other charges or fields present. 

(a) Let P] = pi and P 2 = pk. If pi is held fixed, what is the direction of the 
torque on P 2 ? If P 2 is held fixed, what is the direction of the torque on pj? If 
both vectors are free to rotate, what are the orientations at static equilibrium? 

(b) Rank the following four configurations in order of the interaction energy, from 
smallest energy to largest: (i) pj = pk and P 2 = pk, (ii) pj = pk and p 2 = 
—pk, (iii) pj = p i and P 2 = pi, (iv) pj = p i and P 2 = —p i. (This exercise 
can be solved analytically, but you might find it easier to use a computer to 
calculate the energies, treating each dipole as 2 charges +q and —q separated 
by a small distance, say 0.01a.) 

6.2. Figure 6.11 shows a dipole p at distance zo from a grounded conducting plane, 
taken to be the xy plane. The direction of p is at an angle 6 to the normal of the 
plane. Find the torque on p. What are the equilibrium values of 61 [Answer: N = 
—p 2 sin(? cost? j /(32jreo^o)^ 

6.3. Suppose the magnitude of the permanent dipole moment of a polar molecule is 1 D 
(Debye). If the molecule is placed in an electric field £yk, then the thermal average 
( p z ) is given by the Langevin formula (6.8). 

(a) Plot {p z ) versus temperature T in Kelvin, for £o = 10 6 V/m. Be sure to label 
the axes. Notice that the interesting temperature range is at low T. 

(b) Plot the polarizability, defined by a = d(p z )/dEo, versus T. 

6.4. (a) A dipole p with fixed magnitude |p| = po is placed in an external electric field 

E(x) that varies with position. Show that there is a force on the dipole, and find 
how this force depends on the orientation of the dipole and the variation of the 
field. [Answer: F = (p • V) E = V (p • E)] 


z 

i 

i 



FIGURE 6.11 Exercise 2. An electric dipole interacting with a grounded conducting 
plane. 
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(b) Findthe force and torque on a dipole in the field of a point charge. Letthecharge 
q be at the origin, and the dipole 

P = PO (sin£ 0 i + cos| 0 k) 

be at the point (0, 0, zp). Also, find the force on q, and verify Newton’s third 
law. [Answer: The force on q is 

P ° q , (- sin £ 0 i + 2cos£ok) •] 

4jre 0 z5 v ' 

Sec. 6.2. Polarization and Bound Charge 

6.5. A dielectric object that has a quasi-permanent polarization when the applied field 
is 0 is called an electret. Consider a uniformly polarized electret in the shape of a 
cylinder of height h and radius 10 to. The polarization in the dielectric is Pk, where 
k is parallel to the cylinder axis. 

(a) Sketch the electric field lines. 

(b) Calculate the electric field E at the center of the cylinder. Because the radius is 
large compared to the height, you may neglect edge effects. 

(c) Calculate the electric field E on the midplane of the cylinder, at distance 1 OO/i 
from the center. Because the distance is large compared to the radius, the dipole 
dominates the multipole expansion. 

6.6. Many microphones manufactured today are based on the “foil electret” design. Ob¬ 
tain information on the foil electret from a library or the internet. What is a foil 
electret, and how is it used in a microphone? 


Sec. 6.3. The Displacement Field 


6.7. Two concentric conducting spherical shells, with radii a and la, have charge +Q 
and — Q respectively. The space between the shells is filled with a linear dielectric 
with permittivity 


e(r) = 


ep a 

1.5a -0.5r ’ 


which varies with radial distance r from cp at r = a to 2ep at r = la. 

(a) Use Gauss’s Law to determine the displacement field between the spherical 
shells. 

(b) Determine the bound charge density between the spherical shells. 

(c) Determine the total energy U of this system, from the relation U = j/ DErf 3 .t. 

(d) Determine the capacitance. [Answer: C = 4^eoa/(0.75 — 0.5 In 2)] 

6.8. On the Clausius-Mossotti formula 

(a) Sketch a plot of the dielectric constant k versus atomic density n. 

(b) According to Langevin’s formula, the polarizability of a polar molecule at tem¬ 
perature T, for pE <SC kT, is a = p^/(3kT), where p is the permanent dipole 
moment. From the measurement k = 80 for water, calculate the dipole moment 
of a water molecule. Express the result in units of eag. 
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6.9. A point charge Q is embedded in a dielectric medium with dielectric constant k. 

(a) What is the free charge enclosed by a sphere of radius R centered at Q1 

(b) What is the bound charge enclosed by the sphere? How does the result vary with 
R1 

(c) What is the total charge enclosed by the sphere? 

(d) Explain the result of (b) microscopically, assuming an atom to be a tiny pair of 
equal but opposite charges ±e with dipole moment p = aE. (Hint: For small 
atomic density n , approximate Xe = na/e q.) 

6.10. The dielectric constant of air is 1.00059. From this determine the mean polarizability 
of atmospheric molecules. Compare the result to the atomic polarizabilities in Table 
6 . 1 . 


Sec. 6.4. Dielectric in a Capacitor 

6.11. Consider a parallel plate capacitor, with plate area A, plate separation d, and dielec¬ 
tric constant k of the material between the plates. 

(a) Evaluate the capacitance if A — 100cm 2 , d — 1 mm, and k — 120. 

(b) If the potential difference between the plates is 6 V, calculate the energy of the 
capacitor. [Answer: U = 1.9 x 10“ 7 J] 

6.12. Two conductors of arbitrary geometry are embedded in a uniform linear dielectric 
with dielectric constant k. Show that the capacitance is equal to k times the capaci¬ 
tance for the same conductors in vacuum. 

6.13. A multi-plate capacitor consists of 6 parallel plates, as shown in Fig. 6.12. Alter¬ 
nating plates are connected to opposite terminals. Calculate the capacitance if the 
plates have area A, separation d, and are separated by a material with permittivity e. 
[Answer: 5 eA/d] 

6.14. Two isolated square parallel conducting plates, of side L and separation d, are 
charged with surface densities +<r on the upper plate and —a on the lower. Two 
dielectric slabs, each with thickness d/ 2 and area L x L, are inserted between the 
plates, one slab above the other as shown in Fig. 6.13(a). The dielectric constants are 
k\ and K2- Assume rf«L. Determine: 

(a) D everywhere between the plates. 

(b) E everywhere between the plates. 


+ 


FIGURE 6.12 Exercise 13. A multi-plate capacitor with six plates. 
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HinH It s ■* : 

K 2 


(a) 


+Q 


k 2 


-Q 

(b) 

FIGURE 6.13 (a) Exercise 14. A cylindrical Gaussian surface is indicated by the dashed 
outline, (b) Exercise 15. 


(c) The bound surface charge densities ay, on the three dielectric surfaces. 

(d) The capacitance. [Answer: C = (€qL?/ d){2ic\K2/(K\ +<q)}] 

Finally, verify Gauss’s Law in the form f D • dA = Q f and in the form f E • dX — 

Q/e 0 • 

6.15. Suppose the volume between the two plates in the previous exercise is filled with two 
dielectric slabs, each with thickness d and area L x L/2, inserted between the plates 
side by side as shown in Fig. 6.13(b). The dielectric constants are k\ and /q. Charge 
+ Q is placed on the upper plate and charge —Q on the lower plate. Determine: 

(a) D everywhere between the plates. (Hint: a is not constant on the plates.) 

(b) E everywhere between the plates. 

(c) The bound surface charge densities ay, at the four areas where the dielectric slabs 
touch the plates. 

(d) The capacitance. [Answer: C = (e$L?-/d){(K\ + K2)/2}] 

(e) Compare the capacitance for this configuration to that of the configuration in the 
previous exercise, for the special case kj = 1. (Note that the capacitances of the 
two configurations are the same if k\ = K 2 -) 

6.16. Consider a spherical capacitor consisting of concentric conducting spheres of radii 
a and b. If the region between the spheres is vacuum, then the capacitance is 
4neoab/(b — a). 

(a) What is the capacitance if the volume between the conductors is partly filled 
with dielectric as shown in Fig. 6.14(a)? (Hint: D n is continuous at the dielectric 
boundary.) [Answer: C = 4jtKeQab(a + b)/((Ka + b)(b — a))] 

(b) What is the capacitance if the volume between the conductors is half-filled with 
dielectric as shown in Fig. 6.14(b)? (Hint: E ( is continuous at the dielectric 
boundary.) [Answer: C = 2jreo(K + 1 )ab/(b — a)] 
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(a) (b) 


FIGURE 6.14 Exercise 16. The concentric conducting spherical shells have radii a and 
b. A dielectric material occupies part of the volume between the spheres. In (a) the dielec¬ 
tric is a spherical shell with inner radius a and outer radius (a + b)/2. In (b) the dielectric 
is a hemispherical shell of thickness b — a. 


Sec. 6.5. Boundary Value Problems with Dielectrics 

6.17. Consider an electric field line passing through a planar interface between two insu¬ 
lating media with dielectric constants k\ and K2■ Assume there is no free charge on 
the interface. Let 9\ and be the angles between the field line and the normal to the 
interface in the two regions. Prove that /q cold] — /q cot 62 - 

6.18. For a point charge q above a planar dielectric surface, shown in Fig. 6.9, show that 
the boundary conditions on E; and D„ imply q + q' = q" and q — q' = icq". Here 
q' and q" are the image charges; see Eqs. (6.67) and (6.68). 

6.19. A point charge q is located in vacuum at distance z from a dielectric with a planar 
boundary surface. 

(a) Determine the force on the charge. 

(b) Determine the surface charge density on the dielectric boundary. [Answer: er = 
— (2dq/4nr^){(K — 1)/(k + 1)}] 

6.20. (a) Sketch a plot of the electric field lines fora dielectric sphere in a uniform applied 

field. Show the directions, and explain the directions from the distribution of 
bound charge. 

(b) Sketch a plot of the electric field lines for an electrostatic system consisting of a 
point charge above a planar dielectric surface. 

6.21. A slab of dielectric with area A and thickness t is placed in a parallel plate capacitor 
having plate area A and separation d , with d > t. Show that the capacitance is 
e 0 eA/[e(d -t) + e o']- 

6.22. Plot a graph of the force pulling a dielectric slab into the region between parallel 
plates with fixed charges ±Q (the system shown in Fig. 6.10) as a function of the 
length x of slab between the plates. Let Q = 1 /aC, e — 5ep, d — 1 cm, w — 5 cm, 
and l = 20 cm. Note that Eq. (6.73) was derived assuming d«r and d «(-r. 
What is the force on the slab if x = tl Why is (6.73) not correct in the latter case? 
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6.23. A hollow dielectric sphere, with dielectric constant k, inner radius a and outer radius 
b, is placed in a uniform applied field £ok. The presence of the sphere changes the 
field. Find the field inside and outside the sphere, and in the dielectric material. What 
is the field at the center of the sphere? What is the dipole moment of the sphere? 
[Answer: 


= 47Tf 0 b 3 (a 3 - b 3 )(2K + l)(y - l)E n 
Pz 2a 3 (/c-l) 2 -fc 3 (2/c + l)(/c + 2) 


General Exercises 

6.24. A high voltage coaxial cable has an inner wire with radius 0.3 cm and an outer 
sheath with radius 0.8 cm. The insulating material is a plastic with dielectric strength 
40 MV/m and dielectric constant k = 3. What is the maximum voltage difference 
V'max that can be supported without dielectric breakdown? Does V max depend on k! 

6.25. (a) Two parallel conducting plates in air are connected to the terminals of a 12 V 

battery. The capacitance in air is 1.0 //.F. Then the battery is disconnected and a 
plexiglas sheet that just fits the gap is inserted between the plates. What is the 
final potential difference, in V? What is the stored energy, in J? 

(b) Again the two plates in air are connected to the 12 V battery. While the battery 
is still connected, the plexiglas sheet is inserted between the plates. What is the 
stored energy, in J? 

6.26. The space between two parallel conducting disks, which have radii R and separation 
d R, is filled with a nonuniform dielectric whose permittivity varies linearly as a 
function of the distance from the center, 

e(r) = <q + (e 2 - €\)r/R. 

Calculate the capacitance. [Answer: (;r/? 2 /rf)(ei/3 + 2^2/3)] 

6.27. Two identical parallel-plate capacitors, each with capacitance C, are connected in 
series across a fixed total potential V. Then a slab of dielectric, with dielectric con¬ 
stant k, is inserted into one of the capacitors. Calculate the change of total energy 
stored in the two capacitors, the work done by the electrostatic force on the slab as 
it is inserted, and the energy supplied by the voltage source. 

6.28. Consider two insulating media with dielectric constants <q and k 2 , placed together 
with a planar interface between them. In the region of iq there is a line charge with 
charge per unit length k, parallel to the interface at perpendicular distance a. Use 
the method of images to find the electric field in both media. Show that the force per 
unit length on the line charge is 

dF A 2 (iq — k 2 ) 
dL 47T6]a(xr] + k 2 )' 

Explain physically why the force is away from the interface for iq > k 2 and toward 
the interface for *q < k 2 . 

6.29. A spherical capacitor consists of an inner metal sphere at radius a , an insulating shell 
with dielectric constant k\ for a < r < b, another insulating shell with dielectric 
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constant K 2 for b < r < c, and an outer metal sphere at radius c. Determine the 
capacitance. 

6.30. Two coaxial thin-walled conducting tubes with radii a and b are dipped vertically 
into a dielectric liquid of suspectibility Xe and mass density p. If a voltage difference 
Vo is applied to the tubes, the liquid rises to a height h in the space between the tube 
walls. 

(a) What is h in terms of the other parameters of the problem? [Answer: h = 
( e 0XeV$)/(pg(b 2 - a 2 ) ln(b/a)).] 

(b) Note that this is generally a small effect. For example, calculate h if the dielectric 
is water, fora = 1.0 cm, b = 1.2 cm, and Vq = 500 volts. [Answer: 2.2 mm] 

6.31. A very long dielectric cylinder of radius a and dielectric constant k is placed in a 
field Eo perpendicular to its axis. 

(a) What is the electric field inside the dielectric cylinder? 

(b) What is the induced dipole moment per unit length? 

(Hint: Take the cylinder axis to be the z axis, and Eq = Eo' • The potential for a 
line dipole, with dipole moment per unit length p' = Ci, is Ccos<p/(2neQr) in 
cylindrical coordinates.) 



CHAPTER 



Electric Currents 


So far we have only considered electrostatics, the physics of charges at rest. In this 
chapter we begin the study of electrodynamics, the physics of charges in motion. 
An electric current is a net flow of charge, due to motion of charged particles, 
as illustrated in Fig. 7.1. The mathematical description of current is a necessary 
preparation for the theory of magnetism, because electric current is a basic source 
of the magnetic field. 

The most familiar electric currents are those in metal wires, such as incandes¬ 
cent bulb filaments, power cords of appliances, and windings of electric motors. 
But there can also be currents in regions where there are no metal conductors. 
There are currents in fluorescent lamps and neon signs, in which charges move 
through a gas at low pressure in a tube. Lightning is flow of charge through the 
open air. There are electric currents in nerve cells, due to motion of positive and 
negative ions through plasma membranes. 

A simple, prototype current is steady-state, or direct current, i.e., one in which 
the mean velocity of the charges is constant. Ordinary household and industrial 
applications involve alternating currents, which oscillate at a frequency of 60 Hz. 
The essential physics of steady-state currents applies also to ac currents because 
the time scales of the microscopic processes that determine electrical conductivity 
are very short. For example, Ohm’s law, and the resistance of a wire, are essen¬ 
tially the same for direct current or 60 Hz alternating current. 

“Flow” is a continuum concept. Think of water flowing in a river. But charge 
resides in discrete atomic particles, which move individually in very irregular 
motions. Because current is the net flow of charge, averaged over many particles, 
it is a smooth function of x and t. 

Some results in this chapter, particularly the continuity equation, are true for 
currents with arbitrary time dependence, as we will see. The continuity equation 
is a fundamental result, relating current and charge densities, and expressing the 
conservation of charge. 


7.1 ■ ELECTRIC CURRENT IN A WIRE 
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The conceptually simplest example of an electric current is the current in a thin 
conducting wire. In an ideal one-dimensional wire the current I is defined as the 
net charge passing a point P per unit time. In a real wire / is defined as the charge 
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FIGURE 7.1 Current. Current may be produced by motion of either positive or negative 
charged particles, or both. 


per unit time passing through a cross section of the wire at P 



(7.1) 


where d Q denotes the net charge passing the position P during the infinitesimal 
time interval dt. For a current to the right (see Fig. 7.1), 


dQ=dQ+(R)-dQ-(L), 


where dQ+(R) is the charge of positive carriers moving to the right past P, and 
dQ-{ L) is the charge of negative carriers moving to the left, as illustrated in 
Fig. 7.1. The unit of current is the ampere (A), which is the basic electric unit in 
the SI system. The definition of the ampere is discussed in Chapter 8, in terms of 
the magnetic force on a current carrying wire. 1 A coulomb of charge is defined 
by 1 C = 1 A s. If the wire carries a current of 1 A, then 1 C of net charge passes 
P each second. 

If the current in a wire is due to charges q moving with mean velocity v, and 
the charges have linear density ni (= number of charge carriers per unit length), 
then 


I = qniv. (7.2) 

Note that the units of qniv are C/s = A. To appreciate the meaning of (7.2), 
consider the wire shown in Fig. 7.2. Assuming the mobile charges are positive, 
the amount of charge that will pass the cross section at P, during dt, is equal to the 

1 By definition, the force per unit length on very long parallel wires carrying 1 A and separated by 1 m 
is exactly 2 x 10 -7 N/m. 
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v dt 



P 


FIGURE 7.2 Current in a wire. For positive charge carriers, the charge in the length 
vdt passes a cross section at P during the time dt. Therefore I = qniv. 


charge of the mobile particles contained i n the length v dt upstream of P, because 
those are the particles that pass P during dt. That charge is dQ = qni(vdt); 
hence (7.2). 

In a metal wire the charge carriers are the conduction electrons, and each elec¬ 
tron has negative charge — e where e = 1.602 x 10“ 19 C. A current in the wire 
corresponds to electrons moving on average in the direction opposite to tHfe cur¬ 
rent. dQ in (7.1) is the net charge passing P: negative charge moving in the neg¬ 
ative direction is a positive current, as is positive charge moving in the positive 
direction. If there are both positive and negative charge carriers ±q with equal 
linear density ni and moving with velocities u± in the one-dimensional space, 
then / = qni{v+ — u_). 


7.2 ■ CURRENT DENSITY AND THE CONTINUITY EQUATION 

Equation (7.1) defines the current in a wire, i.e., along a curve. More generally 
we need to define current throughout a volume of space. For example, the space 
might be occupied by a metal or a plasma. How should we describe the current at 
a point in the material? We denote by J(x) the volume current density at the point 
x. The definition of J(x) is that if dA is an infinitesimal area at x in the volume, 
then J(x) • dA is the net charge per unit time passing through dA\ that is, 


dI=3-dA, (7.3) 

as illustrated in Fig. 7.3. In other words, 7, (x) is the current per unit area at x in 
the direction of e, . 2 The unit of current density is A/m 2 . 

If the current in a volume of space is due to charges q with volume number 
density n (= number of charge carriers per unit volume) moving with mean ve¬ 
locity v, then the current density is 


J = qn\. (7.4) 

Note that the units of qn\ are A/m 2 . Current density is the flux of electric charge. 
In general flux is equal to density times velocity. For example, in hydrodynamics 
the mass flux is the mass density times the fluid velocity. Or, another example will 

2 The suffix i = 1,2,3 corresponds to the x , y, z direction, respectively. 
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FIGURE 7.3 Volume current density. J is the current per unit area, so the charge pass¬ 
ing the area dA during dt is J • tidAdt = J t dAdt, which is equal to qnv,dAdt if the 
charge carriers have velocity v. 


be discussed later in this book when we will leant that the energy flux in an elec¬ 
tromagnetic wave is the field energy density times the velocity of propagation. 
In (7.4) qn is the charge density of the current carriers. To understand (7.4) con¬ 
sider Fig. 7.3, which demonstrates that the flowing charge contained in a volume 
of size dA ■ \dt will pass through dA during the time interval dt. That charge, 
qn(dA) ■ \dt, is by definition J • (dA)dt\ hence (7.4). 

Besides one-dimensional current (/) in a wire, and three-dimensional current 
density (J) in a volume, we also encounter two-dimensional current density (K) 
for charge flow on a surface. The definition of surface current density is that the 
charge per unit time flowing across a line segment dt on the surface is 

dl = K z L dl, (7.5) 



FIGURE 7.4 Surface current density. K is the current per unit length on the surface. 
The charge per unit time passing through a line segment dt is K • ex dt, where ej_ is 
perpendicular to dt, and lies on the surface. In the figure the direction of ex has been 
chosen such that K • ex is positive. 
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where ej_ is a unit vector lying in the surface and perpendicular to dl. The surface 
current is illustrated in Fig. 7.4. If the current in the surface is perpendicular to 
dl then dl = K dl', if the current is parallel to dl then dl is 0. The units of K 
are A/m. If a is the charge per unit area of charges moving on the surface with 
velocity v, then K = ctv. 

7.2.1 ■ Local Conservation of Charge 

The net charge of an isolated system is constant. But furthermore, charge is not 
only conserved overall in a system, it is also conserved point by point throughout 
the system. This local conservation of charge is described mathematically by the 
continuity equation 

(7.6) 

Here p(x, t) is the volume charge density (= charge per unit volume), and J(x, r) 
is the volume current density (= current per unit area). Equation (7.6) is univer¬ 
sally true, for arbitrary time dependence. 

The continuity equation (7.6) states that the rate of charge flow away from the 
point x is equal to the rate of decrease of charge at x. Recall the meaning of diver¬ 
gence from Chapter 2. By the definition (2-34), V• Jd 3 x is the flux of J through the 
boundary of the infinitesimal volume d^x. Because charge is conserved, that flux 
must equal the rate of decrease of charge inside d^x, i.e., — (9p/9f)d 3 .r; hence 
(7.6). If we integrate Eq. (7.6) over an arbitrary volume V, and use Gauss’s the¬ 
orem to convert the volume integral of V • J to the surface integral of J, then we 
obtain the integral form of the continuity equation, 

<(j-dA = -4- f pd 3 x. (7.7) 

Js dt Jv 

Equation (7.7) states that the rate of charge passing outward through the closed 
surface S is equal to the rate of decrease of charge in the enclosed volume V. 
Equation (7.6), or equivalently (7.7), is a basic equation of electrodynamics, ex¬ 
pressing local conservation of charge. 

Conservation of charge is one of the most profound laws of nature. From 
an experimental viewpoint, charge is conserved because the elementary charged 
particles—electrons and protons in ordinary matter—are themselves conserved, 
never appearing from, nor disappearing into, nothingness. In modem theories of 
the fundamental interactions, conservation of charge is a consequence of gauge 
symmetry, i.e., the fact that the dynamics is invariant under certain transforma¬ 
tions of the basic fields called gauge transformations. This abstract idea has far- 
reaching theoretical implications. 

7.2.2 ■ Boundary Condition on J(x, t) 

In field theory the primary equations are partial differential equations, which spec¬ 
ify how the fields vary in space and time. If there are boundaries in the system, 
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FIGURE 7.5 The boundary condition for J. For a steady current, V • J = 0. Then 

Jin — J\n- 


such as interfaces between different materials, then the differential equations must 
be augmented by boundary conditions. The appropriate boundary conditions are 
themselves determined by the partial differential equations. We encounter numer¬ 
ous examples of boundary conditions in field theory. For example, we have al¬ 
ready seen that the tangential components of E(x) are continuous across a bound¬ 
ary surface. Also, the normal component of D(x) is continuous across a surface 
without free charge; or, D n jumps by <Tf ree across a surface with free charge den¬ 
sity. 

In this section we derive the boundary condition satisfied by the current density 
J(x, t). Let S be a surface in space. S may be any surface—even an imaginary 
mathematical surface—but the interesting case is where S separates two different 
materials. Now consider a cylindrical Gaussian surface E, in the shape of a pill 
box of height h, cutting across S, as shown in Fig. 7.5. According to the continuity 
equation, the flux of J out through E is equal to —dQ/dt, where Q is the charge 
enclosed by E. To derive the boundary condition, let h —► 0. In the limit, the 
cylindrical side of E approaches 0 in area, so the flux through it approaches 0; and 
the top and bottom faces of E each approach a patch of area A on S, from opposite 
sides of S. Thus the flux integral approaches f A (J 2 n — J\ n )dA, where J\ n and Jj n 
denote the component of J(x, t) normal to the surface (in the direction from 1 to 
2) on the opposite sides of A. As h —> 0, the enclosed charge Q approaches the 
surface charge f A odA. Using the fact that these surface integrals must be equal 
for any area A on S, we see that the normal component of J(x, t) must satisfy 

da 

hn - hn = ( 7 - 8 ) 

If the surface charge density a on S is constant in time, for example 0, then 
the normal component of J(x, t) is continuous across S. That is a consequence of 
charge conservation: If charge flows into the surface on one side, then an equal 
amount must flow away from the surface on the other side if a remains constant. 
Conversely, if J n is discontinuous, then the difference between the current away 
from the surface on one side and that into the other side must equal the rate of 
decrease of a. 
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The argument leading to (7.8) is a good example of a derivation of the bound¬ 
ary condition for a vector function. The starting point was the partial differential 
equation V • J = — dp/dt. Because this involves the divergence, it implies a con¬ 
dition on the normal component at any surface. 


7.3 ■ CURRENT AND RESISTANCE 

We learned in Chapter 4 that a conductor in electrostatic equilibrium is an equipo- 
tential, with E = 0 inside it. However, if there is current in the conductor then it 
is not an equipotential. The current is driven by the potential gradient, i.e., by 
nonzero E in the conductor. At the end of this section we will see that for a steady 
current, as for electrostatics, the interior of the conductor is electrically neutral; 
any excess charge must reside on the surface. Charge moves around inside the 
conductor in response to the electric field but p — 0 if the current is constant in 
time. 

7.3.1 ■ Ohm's Law 

How is the current related to the potential gradient in a conductor? If two terminal 
points on a conductor are held at a constant potential difference V, e.g., by con¬ 
necting them to the electrodes of a battery, then in equilibrium a steady current 
flows through the conductor. Let I be the total current at either point, i.e., the inte¬ 
grated flux / J• dA through a surface inside the conductor surrounding the point. 3 
It is found empirically that for many cases the current and potential difference are 
proportional, 


V = IR. (7.9) 

Equation (7.9) is Ohm’s law. 4 5 6 The constant of proportionality R is called the 
resistance of the conductor. The SI unit of resistance is the ohm (£2), defined 
by 1 £2 = 1 V/A. The reciprocal 1 /R is called the conductance, and its unit is 
£2 -1 , or siemens (S). 5,6 Ohm’s law holds to a very good approximation for many 
conductors. However, it is not a universal principle, and in Sec. 7.7 we’ll see an 
example where it does not hold. 

The resistance I? of a sample of matter is a function of the geometry (size and 
shape) of the sample, and of the material composition. For example, the resistance 
of a uniform cylinder, of length L and cross section A, is proportional to L and 
inversely proportional to A, 

3 The surface integral does not include the wire from the battery. For a closed surface integral, which 
would include the cross section of the wire, <f J ■ dA = 0 by conservation of charge. 

4 Georg Simon Ohm, b, Erlangen in 1789, d. Munich in 1854. 

5 William Siemens, b. Hannover in 1823, d. London in 1883. 

6 In some places the unit of conductance is called by the cute name “mho” for 1/ohm. Honoring Heir 
Siemens is preferred. 
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TABLE 7.1 Resistivity p for some materials 
at 0° C or at room temperature 


Material 

Resistivity p (£2 m) 

pure metals a (0° C) 

Ag 

1.47 x 10" 8 

Cu 

1.54 x 10" 8 

A1 

2.43 x 10" 8 

Be 

2.71 x 10" 8 

W 

4.82 x 10" 8 

Zn 

5.59 x 10" 8 

Fe 

8.64 x 10" 8 

Pt 

9.8 x 10" 8 

Cs 

18.0 x 10" 8 

Pb 

19.2 x 10" 8 

Ti 

45 x 10~ 8 

Hg 6 (liq., 20° C) 

95.8 x 10" 8 

Bi 

127 x 10" 8 

ff-Mn 

137 x 10" 8 

sea water 

0.21 

semiconductors (20°C) 

Ge 

~ 0.5 

Si 

~ 2300 

insulators 

wood 

10 8 to 10 11 

glass 

10 10 to 10 15 


a S. Bass, Landolt-Bomstein Volume 15. Metals: 
Electronic Transport Phenomena (Springer-Verlag, 
Berlin, 1982). 

b American Institute of Physics Handbook, 2nd ed. 
(McGraw-Hill, New York, 1963). 


R = pL/A. (7.10) 

The parameter p (not to be confused with charge density p(x)!) is an intrinsic 
property of the material called the resistivity. Table 7.1 lists the approximate re¬ 
sistivities of some materials. Metals, semiconductors, and insulators have vastly 
different values of resistivity; the mechanisms of charge transport are differ¬ 
ent for these different classes of materials. Typical ranges of resistivity are: 
p (pure metals) 10 _8 f2m, p(alloys) ~ 10 -6 fim, p(semiconductors) « 

10 -5 — 10 3 £2m, and p(insulators) ~ 10 8 — 10 17 £2m. We also define the con¬ 
ductivity a of the material by o = 1 /p; the units of a are (Q m) _1 , or S/m. 



230 


Chapter 7 Electric Currents 


Table 7.1 illustrates the full range of p for pure metals, which is only two 
orders of magnitude. The table also shows that resistivities of insulators are much 
higher than those of metals; indeed the common definition of an insulator is that 
the material does not conduct electricity. The microscopic reason for large p is 
that the energy required to move electrons from bound states in the insulator’s 
atoms, into states in which they can move through the insulator, is very large, so 
that essentially no electrons are available to transport charge. 


Local Form of Ohm's Law 

We may also write a local form of Ohm’s law, which is a more basic equation than 
(7.9): 


J(x) — ctE(x). (7.11) 

Again, a is the conductivity. We will often use a rather than p to express the 
material property; to use Table 7.1, a — 1/p. This proportionality between the 
current density J and the electric field E, point by point in the material, holds to 
a good approximation throughout uniform isotropic conductors. Equation (7.11) 
is not universally true, but rather an empirical fact for many isotropic materials. 
A physical model that suggests why J should be proportional to E in simple ma¬ 
terials will be explored in the next section. In crystals it is found experimentally 
that an electric field in one direction can cause current in another direction; in this 
case the conductivity a must be generalized to a tensor quantity. 


Resistance Calculations 

We can use the local form of Ohm’s law (7.11) to calculate the resistance of a 
sample of matter with a given geometry. For example, consider a uniform cylinder 
of length L and cross section A, with a potential difference V between the ends. 
The electric field in the cylinder is £ = V / L, and the current is I — J A — oE A. 
Thus the resistance is 



L 

o A’ 


which is the relation (7.10) stated earlier. 


(7.12) 


EXAMPLE 1 Consider a cylindrical resistor with a resistivity p(z.) that varies 
along the length as shown in Fig. 7.6. The resistant material occupies the re¬ 
gion 0 < z < h and the cross section is A. Above and below the resistor is a 
perfect conductor (p — 0). We’ll calculate the total resistance, and the net charge 
distribution of the resistor if there is a steady current I. 

The resistor can be subdivided into slices of height dz, and the resistance of 
each slice is dR — p(z.)dz./A. Because the current flows uniformly through the 
slices in series, the total resistance is the sum of the resistances of the slices, 
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FIGURE 7.6 Example 1. A cylindrical resistor with resistivity p (z) that varies along its 
length. 


R = I dR = ~J^ p(z)dz. (7.13) 

If the resistivity is a constant po, then R = poh/A. 

Inside the resistor, and on the top and bottom faces, we can determine the 
interesting charge distribution using the methods of Chapter 6. The current is 
uniform so J(x) = Ik/A. Then the electric field is 7 


E(x)=^-k. (7.14) 

A 

Assuming the permittivity e is constant, the free charge density in the volume of 
the resistor is 

Pfree(x) = eV • E = ^ . (7.15) 

The surface charge densities on the boundaries at z = 0 and z = h are 


°free — 


+e£ z (0) = ~-p(0) for z = 0 

A j 

-eE z (h) = - e —p(h) for z = h 


(7.16) 


Thus a positive charge exists on the upstream boundary surface, e.g., at z = 0 
if I > 0, and a negative charge exists on the downstream surface. However, the 
total charge on the resistor is 0 because the integrated volume charge, which is 
e 7 [p(h) — p(0)], is equal but opposite to the net charge on the surfaces. 


7 Note that all field equations and boundary conditions are satisfied by this solution: The curl of E is 0, 
E is normal to the equipotential surfaces at z = 0 and z = h, and J is tangential on the cylinder. 
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EXAMPLE 2 What is the resistance for a radial current between concentric con¬ 
ducting spheres with radii a and b, if there is a material with conductivity a be¬ 
tween the spheres? 

We’ll solve this problem in two different ways. First, using the geometrical 
method of the previous example, divide the conducting volume into spherical 
shells of radius r and thickness dr, and treat the shells as resistors in series. The 
resistance of the shell at r is d R = dr/(a Am 1 ). The total resistance is f dR\ 
that is, 


1 

47TCT 



(7.17) 


Second, calculate V/I for a given potential difference V between the spheres. To 
have the potential difference V we know from electrostatics (Gauss’s Law) there 
must be a charge Q on the inner sphere, with 


V = 


4;reo \« b) 


(7.18) 


The current is obtained by integrating J, which is a E, over any concentric sphere 
in the conducting region 


I = 




aE ■ dA 


°Q 
eo ’ 


(7.19) 


where the second equality follows from Gauss’s Law. The resistance R — V/I is 
again (7.17). 


Now is a good time to derive an interesting general relation connecting the 
capacitance C of two conductors and the resistance R between them. Consider 
Conductors 1 and 2 shown in Fig. 7.7. A potential difference V has been set up 
between them, say by connecting and then disconnecting a battery, so that they 
have charges ±Q, where C = Q/V. The space around the conductors is filled 
with a material of permittivity e and conductivity a . Some lines of E are also 
shown on the figure. Current I = V/R will flow from 1 —> 2 along the lines of E. 
Suppose a is small so that I is small. 

Now consider the Gaussian surface S, shown dashed in Fig. 7.7, which com¬ 
pletely surrounds Conductor 1. Just after the battery is disconnected we may write, 
using J = ctE and V • E = pf/e, 

I = (j) J ■ hdA = a <j> E ■ iid A = a J V ■ Ed 3 x = — j pidr’x = —Q. (7.20) 
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FIGURE 7.7 Two conductors in a conducting medium. The capacitance and resistance 
are related by (7.21). S is a Gaussian surface. 


Going from the third expression to the fourth we’ve used Gauss’s theorem. If we 
now substitute / = V/R and Q = CV in the first and last terms of (7.20) we find 
a relation between C and R, 

RC = —. (7.21) 

a 

Equation (7.21) is very interesting because it connects two quantities R and C 
that depend in a complicated way on the configuration (shape, size, separation) of 
the two conductors. The physics underlying this connection is that the same E(x) 
determines both the current and the potential difference. 

Notice that both sides of (7.21) have dimensions of time. We will see in Section 
7.6 that the ratio e/cr is the time constant for decay of a charge concentration in a 
conductor, and the product RC is the decay time for a series RC-circuit. Also, R 
varies inversely with a, and C is proportional to e. Equation (7.21) has practical 
implications too. It shows that any real capacitor has a finite leakage resistance 
because a / 0, and any real resistor has a finite capacitance because e / oc. 
These nonideal properties of real circuit devices obey (7.21). 

7.3.2 ■ Fabrication of Resistors 

A resistor is a circuit element with a specified value of resistance. Resistors are 
found in all practical electric circuits, with resistances ranging from a few £2 to 
kQ, or even Mfi. These resistances are much larger than those in the wires or 
contacts in the circuit. 

Three common types of resistors are wire-wound, thin film, and composition. 
A wire-wound resistor is a long, very fine wire wound on an insulating support. 
The resistance, pL/A, can be predicted precisely, so this type is useful where an 
accurate resistance value is important. The wire is usually a metal alloy, such as 
manganin or Constantin. These materials have resistivities almost independent of 
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temperature, so such resistors can be used in applications where the Joule heat 
is so large that it raises the temperature significantly. In a thin-film resistor, the 
conducting path is a thin film of metal on a cylindrical insulating support. The 
resistance is inversely proportional to the thickness of the film. The advantage of 
this design is that the inductance is small, whereas a long wire, wound in coils, 
would have a large inductance. Thin-film resistors are used in high-frequency ap¬ 
plications, where inductance would produce undesired effects. A composition re¬ 
sistor consists of graphite (C) granules pressed together and held in an insulating 
glue. Graphite is a semimetal, i.e., a conductor with high resistivity. Composition 
resistors are often used in applications requiring large resistance. 

7.3.3 ■ The Surface Charge on a Current Carrying Wire 

Picture a wire whose ends are connected to the electrodes of a battery. The wire 
will carry a constant current because its ends are held at different potentials main¬ 
tained by the chemical emf of the battery. Now consider a local view: The current 
is distributed over the cross section of the wire and is related to the field by the 
local version of Ohm’s law, J = ctE. There is a local field E everywhere in the 
wire where current exists, and it is fair to say that this E drives the charges in the 
current. Now, what is the source of E? This may seem like a simple question, but 
the answer is surprisingly difficult. 

Because there is no time dependence, the source of E can only be a charge 
distribution. But where can the charges be? Inside the wire the charge density 
must be 0. To see this, note that in the steady state the rate of charge entering 
one end of the wire equals the rate of charge flowing out the other end, so that 
no charge is accumulating in the wire. This means that dp/dt — 0 in the wire, 
and so V • J = 0 by the continuity equation. Then by the local form of Ohm’s 
law, V • E = 0 in the wire, and so p = 0 by Gauss’s law, as claimecf. Thus the 
charge producing E is not inside the wire. Outside the wire could be vacuum, so 
the charge isn’t there. All that remains is the surface. The answer to the questions 
we have posed is that there is in general a surface charge distribution on the wire. 
This surface charge determines E, by Gauss’s Law, both outside and inside the 
wire, and hence the current in the wire. 8 

Calculating the surface charge distribution is generally difficult. The reason 
is that it depends sensitively on the geometry of the entire circuit including the 
battery. A few ideal models can be solved analytically, involving for example 
coaxial conductors for the current path and parallel plates or a “spherical battery” 
for the emf. 9 

Although the calculation is difficult for us, nature has no apparent difficulty 
in solving even complicated boundary value problems of this kind. After all, the 

8 J. D. Jackson, Am. J. Phys. 64, 855 (1996). Jackson describes three roles of the surface charge: “(1) to 
maintain the potential around the circuit, (2) to provide the electric field in the space around the circuit, 
and (3) to assure the confined flow of current.” 

9 N. W. Preyer, Am. J. Phys. 68, 1002 (2000). Preyer reviews the history of this interesting subject. 



7.3 Current and Resistance 


235 


wire can be bent into a hairpin, or even tied in a bow, and immediately nature 
arranges the surface charge such that its E maintains the current J = cjE confined 
in the wire. 10 Although the net charge on the surface of a current-carrying wire 
is small, because the capacitance is small, e.g., C ~ 1 pF/m for household lamp 
wire, it is important to the phenomenon. 

The field inside a wire. To find the field inside a current-carrying wire, it is 
necessary in general to solve a difficult boundary-value problem. However, an 
ideal straight wire with circular cross section can be analyzed as follows. Let the 
z axis be the location of the wire, so that by translation invariance the current 
density has the form J = / z (r)k. The field is electrostatic so V x E = 0; then 
because J = ctE, also V x J = 0 in a uniform conductor. Evaluating the curl 
in cylindrical coordinates gives V x J = — ( 8J z /dr) = 0. Therefore J z is 
independent of r, i.e., the current must be uniformly distributed on any plane 
orthogonal to the z axis. It follows that E = £ z k is uniform in the wire, which is 
the solution to the problem. But this ideal straight wire is only a prototype model. 
For a curved wire, J and E will not be strictly uniform, although they will be 
approximately uniform if the radius of curvature is large compared to the radius 
of the wire. 


The Capacitance of a Wire 

In Chapters 3 and 4 we discussed the properties of conducting ellipsoids, and we 
can use the results obtained there to derive the capacitance per unit length of a 
wire. The idea is to approximate the wire by a long thin ellipsoid. The result of 
Exercise 24(b) of Chapter 4 gives the potential Vo of a conducting ellipsoid whose 
total charge is Q. From the result, the capacitance with respect to potential zero 
at infinity is C = Q/ Vo, given by 


8jreoVc 2 — a 2 

C = — n - \ , , ■ (7.22) 

In J + Vc 2 — a 2 j / ~ Vc 2 — a 2 j J 

in terms of a and c, the semiminor and semimajor axes. 

Now, for a long thin ellipsoid, a <£ c; so the argument of the logarithm may 
be expanded as {...) ~ (2 c/a) 2 . We may then write 


C ^ 2.71 60 
L In (L/a)’ 


(7.23) 


where L, the length of the wire, is approximated by 2c. For a wire of length 
L = 20 cm with radius a = 0.5 mm, (7.23) gives C' = 6pF/m, in agreement with 
the estimate used earlier. 

10 “Nature laughs at the difficulties of integration.” —P.-S. Laplace 
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7.4 ■ A CLASSICAL MODEL OF CONDUCTIVITY 

Ohm’s law is not a universal principle but it is, for many materials and for a 
wide range of currents, a very good approximation. Why is Ohm’s law valid? We 
can gain some insight into electrical resistance by examining a simple theoretical 
model of conduction, developed by Drude in 1900, based on classical mechanics. 
Let n be the density of mobile charge carriers in a material, q the charge of each 
carrier, and (v) the mean velocity of the carriers, so that the current density is 
J = qn{\), as in (7.4). 

In a metal the charge carriers are conduction electrons moving at a character¬ 
istic speed and undergoing frequent collisions with microscopic scatterers. (This 
speed is called the thermal speed for historical reasons. We will see below that it 
is characteristic of the metal rather than the temperature.) The velocity of any spe¬ 
cific electron changes frequently because of the collisions. If there is no current, 
then the electrons undergo random motion with no net drift. If, however, an elec¬ 
tric field is applied, there will be a current because the electrons will then move 
by a combination of random walk and a slow drift with the mean velocity (v) in 
the direction of —E. The motion of a conduction electron is illustrated in Fig. 7.8. 

Let A be the mean free path between collisions of the electrons. Then the mean 
time r between collisions is A/u t h, where u t h is the thermal speed, which is much 
larger than the drift speed | (v) |. 11 Now consider a certain electron. Just after a 
collision, the electron has velocity vo. The average of vo over many collisions is 
0, because the collisions randomize the direction of motion. Between collisions 
the electron has acceleration a = # E / ra. Therefore its average velocity between 
collisions is of order ar, and we do not need to be more precise than this for our 
estimate of (v). Substituting this drift velocity into (7.4), we find the proportion¬ 
ality J = <rE, where the conductivity is 



FIGURE 7.8 The Drude model. An electron in a current moves by a combination of 
random walk, and drift in the direction opposite to E. The figure shows a random path in 
two dimensions, but in a real metal the motion is a random path in three dimensions. 

n 


See Exercise 1. 
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nq 2 A. 

a =-. 

mv * 


(7.24) 


Although this model of electric conductivity is only heuristic, it does give a rea¬ 
sonable picture of resistance. For example, it predicts that conductivity o de¬ 
creases as temperature increases, which is Irue for most metals at room temper¬ 
ature, although not for semiconductors. For most simple metals the temperature 
coefficient of resistivity, defined by 



is approximately constant and equal to 0.004 K -1 . 

A characteristic property of semiconductors is that, in contrast to metals, their 
conductivity increases with increasing temperature. The microscopic reason is 
that in semiconductors the energy gap between the bound electronic states in the 
atoms and the conducting states is small enough that as temperature increases, 
electrons are freed from the atoms and thus the density of charge carriers in¬ 
creases; this is in contrast to insulators, as we learned earlier. In condensed matter 
physics one says that electrons in semiconductors are promoted across the energy 
gap between the top of the valence band and the bottom of the conduction band 
by increasing temperature. In the commonly used semiconductors, Ge, Si, InSb, 
and GaAs, that energy gap ranges from 0.2 to 1.4 eV. 

It is appropriate here to discuss our current knowledge of the parameters in 
(7.24). First, the density n of conduction electrons ranges for common metals 
from about 0.9 x 10 28 m -3 for Cs to 25 x 10 28 m -3 for Be; for Cu at room 
temperature n = 8.47 x 10 28 m -3 . Second, there is an interesting history and 
physics associated with uth. Drude’s theory (1900) predated quantum mechanics. 
Classically u t h would be calculated from the equipartition theorem 

5mrj, = \kT (7.25) 

where k is Boltzmann’s constant and T is, say, room temperature. But we now 
know that electrons in a metal do not behave classically, but must be treated by 
quantum mechanics. Because of the Pauli exclusion principle, conduction elec¬ 
trons occupy states that correspond to velocities from 0 up to the Fermi velocity 
v p, defined by 


\mv 2 F = kTp (7.26) 

where 7> is the Fermi temperature. The relevant electron speed in (7.24) is vp 
rather than the classical thermal speed u t h- 7> is much higher than room temper¬ 
ature; for example, Tp is 81,600K for Cu. Therefore the electron speed is ten 
times larger than the classical value, and so the corresponding mean-free path 
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deduced from (7.24) is ten times longer than in the classical model. 12 Finally, 
we note that precise calculations of the mean-free time r, a parameter implicit in 
(7.24), depend on details of the scattering mechanisms for the conduction elec¬ 
trons. Scattering occurs from phonons (quantized lattice vibrations), impurities 
and other lattice imperfections, polycrystalline boundaries, etc. Such calculations 
play an important role in modem theoretical studies of conduction in metals. 


7.5 ■ JOULE'S LAW 

Resistance is a kind of friction. The current carriers in a resistor, e.g., conduction 
electrons in a metal, lose energy by collision with atomic-scale scatterers. The 
energy is transferred to lattice vibrations as heat. We may determine the power 
dissipated in the resistor, as a function of the current I and resistance R. The 
work per unit time done by E as charge passes through the potential difference V, 
is IV . By energy conservation this must equal the power P dissipated as heat 

P = IV = I 2 R, (7.27) 

a result known as Joule’s law. The relation P = I 2 R was discovered by Joule in 
1841, by experiment. Table 7.2 gives some typical examples of current and power 
for everyday devices. 

We may also write a local expression for the power dissipated in resistance, in 
terms of the current density J(x) in the conducting material. The work per unit 
time done on an isolated charged particle q is F • v = qE ■ v, where v is the 
particle velocity. This is the power supplied to q. Similarly, the power supplied to 
all mobile charge dQ in a volume element d 2 x is 

dP = dQv ■ E = p(x)v • E d 2 x r AJ. 28) 

where v is the mean velocity of the charge carriers, and p(x) their charge density. 
But p(x)v is the current density J(x), so the power supplied to dQ is dP = 
J • E d 2 x. That is, the local power per unit volume is J(x) • E(x), and the total 


TABLE 7.2 Currents and Powers 


Device 

Current (A) 

Power (W) 

light emitting diode 

2 x 1(T 2 

10 -2 

electric shaver 

8 x 1(T 2 

10 

table lamp 

0.8 

100 

hand iron 

8 

1000 

starter motor of a car 

180 

2200 

central air conditioner 

20 

5000 


12 


See Exercise 12. 
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power is 


P = J J • Ed 3 x = J pJ 2 d 3 x. (7.29) 

(The p in (7.29) is the resistivity, not the charge density!) Equation (7.29) is the 
generalization of (7.27). Another form of this equation that we will use later, is 
that the power density is cr E 2 . 

Because energy is dissipated in resistance, a source of energy is necessary to 
maintain a current in a resistor. For example, a battery is a chemical reactor that 
maintains a constant potential difference V between its electrodes. The chemical 
reaction is the energy source. If the battery is used in a circuit with resistance R, 
then the ideal lifetime of the battery is UoR/V 2 where Uo is the initial stored 
chemical energy. 

If there is no energy source in a circuit with resistance, then the current must 
decay. What would happen to the current in a battery-driven circuit if the battery 
were replaced by a capacitor with the same voltage? The current would decay to 
0 . 


7.6 ■ DECAY OF A CHARGE DENSITY FLUCTUATION 


In electrostatics the net charge density inside a conductor is 0; in equilibrium 
any net charge of the conductor resides on the surface. But suppose there is at 
some initial time a nonzero charge density po(x) in an isolated conductor with 
conductivity a. How long does it take to reach equilibrium, i.e., for the charge 
density to decay? We will answer this question by finding p(x, t). To begin with, 
we assume that the polarization is negligible in the conductor, so € — eo- 

The charge density decays as charge flows to the surface. According to the 
continuity equation, dp/dt — — V • J. In an ohmic conductor, J is cjE. Also, V • E 
is p/e o by Gauss’s Law. Taken together these equations imply 


dp a 

3 1 eo P 


(7.30) 


The solution to this differential equation is 

p(x, t) = p 0 {x)e~ t/T . (7.31) 


The charge density decays exponentially with time constant r = €q/o . For an 
ideal conductor, i.e., <7 = 00 , the decay time would be 0. For metals r is very 
short, and for insulators it is very long. We leave it as an exercise for the reader to 
substitute physical values, and to calculate r for various materials. 13 

13 


See Exercise 14. 
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Energy is conserved, so the total energy dissipated as heat and emitted as radi¬ 
ation while the charge moves to equilibrium must equal the decrease in the total 
electrostatic energy initially associated with the charge density po(x). At equi¬ 
librium the excess charge resides on the surface of the conductor. The energy 
dissipated thermally is the integrated Joule power, 

POO pOO p 

/ Pdt — / / oE 2 d i xdt. (7.32) 

Jo Jo J 

The spatial integral is over the volume of the conductor. By Gauss’s Law, E(x, t) 
must decay with the same time dependence as p(x, ?), so E = Eoe“'/ r where 
again r = eo/a and Eo is the initial field. Integrating over the time in (7.32) gives 

jf Pdt = J Q J crE^e~ 2,/T d 3 xdt = y J E%d 3 x (7.33) 

which is indeed the initial field energy inside the conductor. At equilibrium there 
is no field energy inside the conductor because the internal field is zero. But we 
must also consider the electrostatic energy outside the conductor. In general the 
external field changes because the charge distribution changes, but (7.33) shows 
that this change does not contribute to the dissipation by heat. Rather the external 
field energy decreases because of radiation. In Chapter 15 we will study radiation 
from time-dependent currents, of which this is an example. 

In our calculations we neglected polarization in the conductor. This is a good 
approximation for metals at low frequencies. For a static system the polarization 
in a conductor is exactly 0 because E = 0. But in the presence of E and J, the 
ions in the crystal lattice may be polarized, leading to bound charge. How are the 
results different if the permittivity is e? In this case Gauss’s Law is V - E = Pf/e, 
where pf is the density of free charge, i.e., conduction electrons. Again J = a E, 
and here J is the current of the conduction electrons. The only change-is to replace 
eo by e, so the decay time constant is r = e/o'. 


EXAMPLE 3 As another example of the decay of a charge distribution, consider 
the RC circuit shown in Fig. 7.9. With the switch as shown, the charge on the 
capacitor is Qo = CVo- Suppose the switch is flipped at t = 0. Then the charge 
at time t is 14 


Q(t) = Qoe~ ,/RC ; (7.34) 

so Q decays exponentially with time constant RC. The power dissipated in the 
resistor is I 2 R, where by charge conservation I = dQ/dt. The total energy dis¬ 
sipated is 

14 


See Exercise 15. 



7.7 


l-V Characteristic of a Vacuum-Tube Diode 


241 



r'--r(i) 2 —i <7j5> 

which is the initial energy stored in the capacitor. 


7.7 ■ l-V CHARACTERISTIC OF A VACUUM-TUBE DIODE 


A vacuum-tube diode consists of a heated cathode at ground potential and an 
anode at positive potential Vo. Electrons are emitted by the cathode and accelerate 
to the anode. The potential difference Vo controls the net current. This seems like 
a very elementary example of electric current, but we will see that it has some 
interesting complexity. We will analyze the steady state of this simple system, 
with charge and current densities independent of time. 

The electrodes are shown in Fig. 7.10. The plates have area A and separation 
d, and are perpendicular to the x axis at x =0 and x = d. To make the calcu¬ 
lation tractable we assume d <<C y/~A so that edge effects are negligible. Then the 
variables of the system depend only on the coordinate x. The charge density and 
current density are p(x) and J = i J(x)\ the potential is V (x). 

The current is due to motion of the electrons, so we need to determine the 
electron velocity as a function of position x in the gap. At the cathode the electrons 
have small velocity, which we take to be 0. They accelerate through the gap to 
the anode. At position x the electron velocity is v = i v(x). By conservation of 
energy, jtnv 2 (x) = eV(x), so the velocity at x is 


v(x) = 


[ 2eV(x) 


1/2 


m 


(7.36) 


We don’t know a priori the potential function V(x). If there were no charge be¬ 
tween the plates, then V (x) would just be Vox / d. But the electron charge also 
contributes to V(x). We need some other equations to determine the potential as 
a function of x. 
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FIGURE 7.10 Cathode and anode of a vacuum tube diode. Electrons boil off the hot 

cathode and accelerate in E to the anode. 


Before we start writing the equations, let’s try to understand the principles in 
physical terms. First, the density p(x) and electron velocity v(x) determine the 
current in the gap. Second, in the steady state the current must be the same through 
any cross section of the gap; otherwise the amount of charge between two cross 
sections would be changing, which is not the steady state. Finally, the potential 
V (x ) throughout the gap is determined by the charge density p(x), by Poisson’s 
equation. These principles, together with the relation (7.36) between the speed 
and potential, imply a unique solution for p(x), V(x), and J(x). 

We are now ready to analyze the equations. Note first that the current density 
J (x) is p(x)v(x). Conservation of charge is expressed by the continuity equation 
(7.6). In the steady state dp/dt is 0; and for this one-dimensional current V • J is 
dJ/dx. Thus the continuity equation reduces to the statement 



(7.37) 


i.e., J (x) is a constant J in the gap. (Note that J is negative: The current in 
Fig. 7.10 is due to electrons moving to the right, i.e., the positive x direction, so 
J = J x is negative.) Combining (7.36) and the equation J = p(x)v(x) gives one 
relation between p(x) and V(x), 


p(x) = ^- = -C[V(x)]-' /2 
v(x) 


(7.38) 


where C is a positive constant, defined by C = — J[m/(2e)]^ 2 . 

Another relation between V(x) and p(x) is Poisson’s equation, — V 2 V = 
p/eo. For this one-dimensional system, V 2 V is just d 2 V/dx 2 . Then using (7.38) 
the potential function must satisfy 


d 2 V 

dx 2 


e o 


(7.39) 
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This ordinary differential equation can be solved by conventional methods. We’ll 
solve it by guessing that the solution is a power of x. We must also satisfy the 
boundary conditions, V (0) =0 and V(d) = Vo, so we write the trial solution as 

v(x)=v °(W (7 - 40) 

where the power p remains to be determined. Substituting this form into (7.39) 
we find 


^p(p - \)x?- 2 = - ^*- p/2 . (7.41) 

dP eo Vq' 

The equation is satisfied for 

4 4e 0 V 0 3/2 

P= 3 “ d C = (7 ' 42) 

Our first result is that V(x) is proportional to x 4 / 3 . The charge density in the gap 
is now determined by (7.38) 


4eoVb 


9 d 2 



(7.43) 


Note that p(x) is proportional to x 2 / 3 . 



x/d 



x/d 

FIGURE 7.11 V (x) and p (x) in a vacuum tube diode, po is the coefficient of (d/x) 2 / 3 
in (7.43). 
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Figure 7.11 shows plots of V( x ) and p(x) as functions of x. The charge in the 
gap is called space charge. If the space charge is initially 0, then as electrons are 
emitted by the cathode a charge density will build up in the gap until (7.38) is 
true. This final state—the steady state—is a state of dynamic equilibrium. 

The total current through any cross section of the gap is I = J A. According 
to the solution (7.42), remembering that J = —C(2e/m )’/ 2 , the current is 


I = - 



,,3/2 
9 d 2 0 


(7.44) 


This current is said to be “limited by the space charge,” because the potential 
gradient near the cathode is less than it would be if there were no space charge. In 
fact, the electric field approaches 0 at the cathode. 

For this elementary device, Ohm’s law is not valid; i.e., I is not proportional to 
Vo- That is not surprising. Ohm’s law is associated with dissipation by collisions, 
but in this vacuum tube there are no electron collisions. The characteristic IV 
dependence I oc V Q is called the Child-Langmuir law. It is common for currents 
limited by space charge. 


Ion rockets. An interesting application of charged particle emission with accel¬ 
eration by an electrostatic field is the ion rocket. Ion rockets have been used for 
three decades as small thrusters in satellites, e.g., to control orientation of a satel¬ 
lite. The basic principle of an ion rocket is that electrostatic acceleration of heavy 
positive ions produces a reaction force on the satellite. If a positive space charge 
were to build up, then the thrust would be reduced. Therefore electrons are also 
emitted to neutralize the space charge. 

Diodes. A diode is a circuit element in which current can flow in only one direc¬ 
tion. Diodes are used in rectifiers and filters. A simple diode acts as a half-wave 
rectifier: If an alternating current is the input to the diode, then the output current 
is just the positive half of the input current. 

Before the development of solid-state electronics, vacuum-tube diodes of the 
kind described above were common. In such devices a hot cathode emits electrons 
that accelerate to the anode. The electrons can only flow in one direction—from 
cathode to anode—and the current is in the opposite direction because the electron 
charge is negative. Figure 7.12 shows the /V curve for a vacuum-tube diode, 
plotting the magnitude of I. I is 0 for V <0, and | /1 oc V 3 ^ 2 for V > 0. 

The system in Fig. 7.10 is an idealized model to explain the physics. In prac¬ 
tice, the cathode is a small cylindrical metal surface with a (red-hot) heating fila¬ 
ment enclosed. The anode is typically a concentric cylindrical surface. The reason 
for heating the cathode is so that electrons come off the metal even though the field 
is zero at its surface, as we found. To facilitate electron emission the cathode is 
coated with an oxide (BaO or SrO). A typical potential for the anode (often called 
the plate) in the conducting mode is Vo = +10 to +100 volts with respect to the 
cathode. The anode is cold and uncoated so that it does not emit electrons even if 
the sign of Vo is reversed. 
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The pressure in a vacuum-tube diode is 10~ 6 Torr or less. Under these condi¬ 
tions the buildup of negative space charge decreases the field at the cathode, as 
we have seen, and inhibits electron flow. The electron space charge is reduced 
in a gas-filled thermionic diode, in which a low pressure of Hg vapor is present 
between the cathode and anode. Then if Vo is greater than the ionization potential 
of Hg (10.4 V), energetic electrons ionize Hg atoms, and the resulting Hg + ions 
move to the cathode, tending to neutralize the negative space charge and enhance 
the electron flow. 

Vacuum-tube diodes have long ago been replaced in most applications by 
semiconductor pn-junction diodes. These solid-state devices are smaller, cheaper, 
more rugged, and require less power (there is no heating filament) than vacuum- 
tube devices. The physics of semiconductors is essentially quantum mechanical. 
The idea is that pure semiconductors, Group IV elements such as Si and Ge, have 
a low density of conduction electrons and therefore do not conduct well. Their 
conductivity can be increased by adding a small concentration (parts per million) 
of a Group V element such as As or Sb. The added donor atoms donate electrons 
to the semiconductor lattice, increasing the conductivity; such a material is called 
an n-type doped semiconductor. 15 The conductivity of a pure semiconductor can 
also be increased by adding a small concentration of a Group III element such as 
A1 or In. In the resulting p-type doped semiconductor the added acceptor atoms 
accept electrons from the lattice and the conductivity is increased because there is 
an increase in the concentration of holes, states from which electrons have been 
removed. 

At a pn-junction, i.e., the boundary between an n-type region and a p-type 
region, the electrons and holes diffuse across the boundary into, respectively, the 
p-type and n-type region. The reason a pn-junction makes a good diode, so that 
such diodes are widely used as rectifiers, comes from the following properties 
of the junction: If an external voltage is applied across the pn-junction in the 
forward bias direction, i.e., with the p-region positive and the n-region negative, 
then high current exists across the junction, in the direction from p to n, with a 

15 n-type and p-type refer to negative or positive charge carriers, respectively. 
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low voltage drop. This is analogous to the situation in a vacuum-tube diode, in 
which the anode is positive with respect to the cathode. If, however, the external 
voltage is applied in the reverse bias direction, i.e., with the p-region negative and 
the n-region positive, charge cannot flow across the junction, as in a vacuum-tube 
diode with the anode negative with respect to the cathode. Semiconductor pn- 
junction diodes in use today range from those in integrated circuits in consumer 
electronics, which carry currents of a few /xA and are a few /xm in diameter, to 
those in large industrial power rectification applications, which carry currents of 
thousands of amperes and have diameters of order 10 cm. 


7.8 ■ CHAPTER SUMMARY 

Three concepts introduced in this chapter will be very important in later chapters. 
The reader should know and understand these equations: 

1 . The definition of current density, dl = J ■ dA. 

2. The continuity equation, V J = —dp/dt. 

3. The two versions of Ohm’s law, J(x) = ctE(x) and V = IR. 

It is a useful exercise to sketch drawings that explain the meanings of each of 
these equations. 


FURTHER READING 

The physics of resistance is an important topic in solid-state physics. To leant more, see 
C. Kittel, Introduction to Solid State Physics, 7th ed. (Wiley, New York, 1996). 
Lightning is a dramatic example of electric current that everyone has observed. It is de¬ 
scribed in The Feynman Lectures, Volume II, Chapter 9. 


EXERCISES 

Sec. 7.1. ID Currents 

7.1. Estimate the mean drift velocity of a conduction electron in copper, assuming that 
there is one conduction electron per atom. Consider No. 14 wire (diameter 0.16 cm), 
common in houses, carrying 5 A. The result should be very small. Why then does 
an electric light turn on almost instantly when you flip the switch? Also, compare 
the drift velocity to the Fermi velocity vp = 1.57 x 10 8 cm/s forCu, which is a 
characteristic speed of the electrons’ random motion. 

7.2. A charged ring rotates about its axis. What is the current? Let the ring have radius 
a, linear charge density X, and angular speed co. 



Exercises 


247 


Sec. 7.2. Volume Current 

7.3. A solid sphere of radius a has total charge Q uniformly distributed throughout its 
volume. The sphere rotates with angular velocity w = cuk. Find the current density 
J(x). Use spherical polar coordinates. 

Sec. 7.3. Ohm’s Law 

7.4. Explain why the resistance of two resistors in series is ??i + f? 2 , and why the resis¬ 
tance of two resistors in parallel is R\ ?? 2 /(/?i + Rl)- 

7.5. A wire has resistance R. Determine the resistance of a wire of twice the length and 
half the diameter, made from the same material. 

7.6. What is the resistance of a copper wire with length 10 m and diameter 1 mm? 

7.7. A conducting sphere of radius a is imbedded in a large volume filled with a material 
with electric conductivity a. The sphere is held at potential Vo with respect to the 
boundary, which may be approximated as spatial infinity. Determine the current den¬ 
sity J(x), the total current /, and the resistance. Verify that the continuity equation 
is satisfied. 

7.8. Two small conducting spheres with radii a and h are imbedded in a medium of 
resistivity p and permittivity e with their centers separated by a distance d a, h. 

(a) What is the resistance R between them? Notice that the result is independent of 
e. 

(b) If a = b = 1 c m and d = 10 m, and the medium i s sea water, what i s the value 
of??? 

7.9. Consider two coaxial conducting cylinders with radii a and 3 a and length L. The 
region a < r < 2a is filled with a material with conductivity a\, and the region 
2a < r < 3a has conductivity <72 • (Assume e\ = e 2 — e o) The inner cylinder is 
held at potential Vo and the outer cylinder at V = 0, so there is a radial current /. 

(a) Determine the resistance. 

(b) Determine the surface charge density on the boundary at r = 2a. 

7.10. A parallel-plate capacitor with plates of area A and separation d is imbedded in a 
large volume filled with a material with electric conductivity a. Determine the total 
current I if the two capacitor plates are maintained at a potential difference Vo- 
Explain how such a device could be used to measure the salinity of salt water. 

7.11. A solid cylindrical conductor of length L and cross section A has resistivity p\ 
in half its length and p 2 in the other half. There exists a current / in the cylinder 
parallel to the axis and uniformly distributed over the cross section. Find the surface 
charge density a on the interface between p\ and p 2 . Assume the permittivity of 
both materials is 

Sec. 7.4. The Drude Model 


7.12. From Eq. (7.24), which is based on the classical model of conductivity, estimate the 
mean free path k of a conduction electron in copper at room temperature, assuming 
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there is one conduction electron per atom. Do the calculation for both the classical 
thermal velocity and the Fermi velocity. Discuss the resulting values of X. 

7.13. Asa classical model of conductivity, assume an electron in a conductor moves sub¬ 
ject to two forces: —eE and — yv, where v is its velocity. (The damping force is a 
model for the energy loss by collisions.) Determine the conductivity in terms of e, 
y, and the electron density. 

Sec. 7.6. Decay Time 

7.14. Calculate the time constant, in seconds, for decay of a charge density fluctuation in 
copper and in germanium. 

7.15. (a) Derive (7.34) for the charge on a capacitor as a function of time in an RC circuit. 

The time constant of the decay is r = RC. By what factor does Q decrease 
between t = 0 and t = r? Between t = r and t = 2r? Between t = 5r and 
t — 6 t? At what time is the charge one-tenth of its original value? 

(b) Verify that the total energy is conserved as the charge comes to equilibrium. 

General Exercises 

7.16. On atmospheric electricity. At the surface of the Earth (radius 6400 km) there 
is an average electric field E r & — lOOV/m, and a corresponding current density 
J r ~ —3.5 x 10~ 12 A/m 2 that carries electrons away from the Earth to the upper 
atmosphere. The potential difference between the Earth and the upper atmosphere is 
400 kV. What is the total current that reaches the Earth’s surface? What power is re¬ 
quired to maintain this current? How does it compare to the output of a power plant? 
Earth’s surface charge is continually restored by lightning, which carries down nega¬ 
tive charge from the upper atmosphere. A typical lightning stroke has a peak current 
of 10 4 A and carries 20 C. How many lightning flashes occur each second, on aver¬ 
age, worldwide? (See The Feynman Lectures, Vol. II, Chap. 9.) 

7.17. You are given three resistors, of 1 £2, 2 S2, and 3 £2. What are all the resistances you 
can make, using either one, two, or all three resistors? (Hint: There are 17 arrange¬ 
ments.) 

7.18. Consider the resistor network shown in Fig. 7.13. There are n resistors crossing 
from the top line to the bottom line, and n — 1 resistors along the top or bottom line. 
The total number of resistors is 3 n — 2. All the resistors have equal resistance R. 
Let R„ be the resistance of the network. Calculate R\, R 2 , R 3 , and Rcc- [Answer: 
Roc = K(V3 - 1).] 



FIGURE 7.13 A resistor array. The resistance is R„ where n is the number of rungs in 
the ladder. In the illustration, n = 5. 
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FIGURE 7.14 A resistor array. The network continues ad infinitum. 


7.19. Consider the resistor array shown inFig. 7.14. Thetotal resistance between A and B 
is Rq. The ladder of resistors extends to the right ad infinitum. (This model resembles 
the resistance in an axon nerve; R\ represents the resistance of unit length along the 
cell wall, and R2 the resistance of unit length across the membrane.) 

(a) Note that the resistance from the part of the array to the right of the left-most 
rung of the ladder is also Rq. From this show that Rq = R\ 4- RiRqZ/Ri + Po)- 

(b) Determine Rq. 

(c) Now suppose a current flows into A and away from B. Let V n be the poten¬ 
tial difference across the nth rung to the right of A and B. Show that the ratio 
Vn + l/V n hR 2 /(Ro + R 2 )- 

(d) Show that V n decreases exponentially with n. 

7.20. A wire of length L and cross section A lies along the positive x axis. The resistivity 
varies with x, as 

p(x) = p 0 +pie~ x/d . 

(For example, the end at x — 0 may have become oxidized, and therefore have a 
larger resistivity than the pure metal.) The end at x = 0 is held at potential Vq and 
the other end is grounded. Determine the current I and power per unit length d P/dx. 
Make a sketch of dP/dx as a function of x. 

7.21. The resistivity of a wire increases with temperature, according to p = pQe af T ~ T<>1 , 
where Tq is room temperature. (The parameter a is called the temperature coefficient 
of resistivity.) Let C be the heat capacity of the wire, in units of Joules/K. The length 
of the wire is l and its cross sectional area is A. At time t = 0 the wire is connected 
to a battery with constant emf Vq. Assuming all the Joule heat goes into raising the 
temperature, determine T as a function of t. Show that T = Tq + a -1 ln(l + a/it), 
where fl is a constant that depends on the parameters of the problem, and find fi. 

7.22. Show that the time constant for discharge of a leaky parallel plate capacitor, from 
current through the dielectric, is r = €qkp, where k is the dielectric constant and p 
the resistivity. Use the equations for R and C of parallel plates to show that r = RC. 

7.23. The region between concentric conducting spheres is filled with a material with con¬ 
ductivity a and dielectric constant k . The inner sphere (radius a) is connected to the 
positive terminal of a battery with emf Vo, and the outer sphere (radius b) is con¬ 
nected to the negative terminal. At t = 0 the battery is disconnected. 

(a) Calculate Q(t), the charge on the inner conductor, and sketch a plot of this 
function. Show that the decay constant is k€q/ a. 

(b) Calculate 7(f). 
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Computer Exercises 

7.24. The current in a thin film may be described by a surface current density K, with 
units A/m. Consider a sample of the form shown in Fig. 7.15. The film lies on the 
xy plane and has a small thickness <5 in the z direction. Then 


K = J5 = K x \ +K y ], 


The continuity equation for a steady current is V • K = 0. Also, because J = crE, K 
is irrotational, V x K = 0. Together these imply that K = — Vd>, where <t> satisfies 
Laplace’s equation, V 2 <b = 0. 

Use the method of iterative relaxation (Sec. 5.5) to solve Laplace’s equation in 
the thin film with the boundaries shown in Fig. 7.15. There are mixed Dirichlet and 
Neumann boundary conditions: 


Of—1, V) = 1 

for 

— 1 < y < 1 


0(1, y) = 0 

for 

-1 < y < 0; 

(7.45) 

3 0/9 v = 0 

at 

y = 1 for — 1 < x < 0, 


a<t>/a v = o 

at 

y = 0 for 0 < x < 1, 


3 0/3 v = 0 

at 

y = — 1 for — 1 < x < 1, 


dt>/dx = 0 

at 

x = 0 for 0 < y < 1. 

(7.46) 


The conditions (7.45) state that the current enters and leaves the film in the i direc¬ 
tion at the two ends; the conditions (7.46) state that the current is tangential on the 
sides. 

(a) Make a contour plot of (x, y) and on it sketch the current vectors. 

(b) Calculate K at some representative points. For example, where is |K| greatest? 


v 


+ 1 


4 >= 1 -^ 

1 

-1 

0 

+ 1 


-1 



FIGURE 7.15 A current-carrying thin film. A current K = — V<t> enters at at = — 1 

and exits at x = +1. 
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FIGURE 7.16 A driven RC circuit. 


7.25. Figure 7.16 shows an RC circuit driven by a harmonic EMF. The differential equa¬ 
tion for the charge on the capacitor is 

£q sin cot = IR + Q/C, 
where I = dQ/dt. The solution has the form 

Q(t) = Q\e~ t l T + A sin(o)t — (f>), 

where r = RC is the time constant. Q\, A, and <p are constants, which must be 
determined from the differential equation and the initial conditions. The first term 
Q\e~ l l x is the transient. 

Rather than solve the equation analytically, use a computer program to solve 
it numerically. (It is just an ordinary differential equation.) Assume at t — 0 the 
charge on the capacitor is 0. Take RC to be the unit of time, and to be specific let 
co = 4/RC and £$/R = 1. Plot Q as a function of time. Also, make a plot showing 
both Q{t) and sin cot on the same graph, and estimate the phase shift of the charge. 
The calculations can also be done analytically. 




CHAPTER 



Magnetostatics 


‘‘To those who do not know mathematics it is difficult to get across a real 
feeling as to the beauty, the deepest beauty, of nature.... If you want to 
learn about nature, to appreciate nature, it is necessary to understand the 

language that she speaks in.” 
—R. P. Feynman, The Character of Physical Law 


Everyone has played with magnets, and felt the mystery of their forces. What 
causes the magnetic force? In 1820 Oersted discovered that magnetic forces are 
produced by electric currents, by demonstrating that a compass needle is deflected 
if placed near a current-carrying wire. Conversely, a magnet creates a force on 
a current. The effect was further studied by Biot and Savart, and by Ampere. 
Ampere in particular explored in detail the magnetic forces between electric cur¬ 
rents. Later, Faraday and Maxwell developed the field theory of electric and mag¬ 
netic forces, in which currents create magnetic fields. 

The subject of this chapter is magnetostatics— time-independent magnetic 
fields, and their interaction with constant currents. The principles we will study 
are exact for magnetic systems with constant currents, and apply approximately 
to systems in which the currents change slowly in time, called the quasistatic 
approximation. 

Some magnetic phenomena were known long before experiments with cur¬ 
rents and wires. Naturally occuring lodestones, which attract iron, were known 
to the people of Ancient Greece. The magnetic compass was invented in China, 
probably over 2000 years ago, and came to Europe in the 12th century. Gerhard 
Mercator, in 1546, introduced the idea that the point a magnetic needle seeks is 
terrestrial, and not in the stars. In 1600, William Gilbert concluded from his ex¬ 
periments on spherical lodestones that the Earth itself is a great magnet; in his 
words, “magnus magnes ipse est globus terrestris .” In modem terms we’d say he 
explored the external magnetic field of a spherical permanent magnet. (We will 
calculate the same field in Chapter 9.) It is interesting to note that Gilbert’s ex¬ 
periments predate by 90 years Newton’s understanding of the terrestrial origin of 
gravity. 1 

1 As an historical aside, before Gilbert's time people had the idea that the behavior of a magnetic 
compass is governed by the stars, because the north pole of a compass points to the North Star. We 
owe a lot to Mercator and Gilbert for looking for a natural cause in the Earth, nearer at hand than the 
stars. 
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In Chapter 9 we will see how the magnetic field of a permanent ferromagnet 
can be attributed to electric currents on the atomic scale. But first we need to 
comprehend the basic principles of magnetostatics. 


8.1 ■ THE MAGNETIC FORCE AND THE MAGNETIC FIELD 
8.1.1 ■ Force on a Moving Charge 

The most basic form of the magnetic force is the force on a charge q moving with 
velocity v, exerted by a magnetic field B, which is 

F = q\xB. (8.1) 

The SI unit of magnetic field i s the tesla (T). One tesla i s defined as one newton 
per ampere-meter. By (8.1) the force on 1 C moving 1 m/s perpendicular to a field 
of 1 T is 1 N. A magnetic field of 1 T is a rather strong field. The largest fields 
that can be produced by conventional electromagnets are about 2 T. For high-field 
research in laboratories, fields of 10-12T, produced with superconducting mag¬ 
nets, are used. The record for steady-state fields is about 30 T. At the surface of the 
Earth the magnetic field is about 0.5x10 -4 T. Very large and very small fields are 
important in astronomy; e.g., 10 8 T near pulsars, and 10“ 10 T in interstellar space 
in the galaxy. 

The direction of the magnetic force on q is sideways, i.e., perpendicular to v 
and to B, as shown in Fig. 8.1 for a positive charge. Therefore, the magnetic force 
does no work on q: 


dW = F-dx=F \dt = 0. (8.2) 

The magnetic force affects the direction of motion of q, but not its kinetic en¬ 
ergy. Because the magnetic force does no work, it is not possible to define a po¬ 
tential energy function for the magnetic force. The magnetic force is velocity- 
dependent, which is quite different from the other fundamental forces we en¬ 
counter in physics. 


B 



FIGURE 8.1 The magnetic force. The force on a moving charge is sideways, perpen¬ 
dicular to both v and B. The figure shows F and the circular trajectory for a positive charge 
in a uniform field. What would the trajectory be for a negative charge —q projected with 
the same initial velocity v? 
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We can demonstrate the magnetic force on a moving charge with a Crookes 
tube and a bar magnet. A Crookes tube 2 is essentially a glass cylinder filled with 
gas at low pressure, with a sealed-in electrode at each of its closed ends. A high 
voltage is applied between the electrodes, creating a beam of electrons traveling 
from the cathode to the anode. The path of the electrons is visible because gas 
atoms along the path are excited by the energetic electrons and emit light. If there 
is no magnetic field the beam is a straight line. But if a magnet is brought near 
then the beam is curved because of the sideways magnetic force (8.1) on each 
electron. 

Equation (8.1) may be taken as the definition of the magnetic field. From the 
measurement of the force on a test charge q, e.g., by observing its deflection for a 
known velocity, the defined quantity B could in principle be deduced from (8.1). 

In using (8.1) we picture a laboratory setup in which there is a B field, e.g., 
due to a large magnet, and v is the velocity of the charge q in the laboratory— 
physicists say “in the reference frame of the laboratory.” The particle is deflected 
by the force calculated by (8.1). But with this velocity-dependent force, it’s natural 
to ask questions like these: What is the force on q in a reference frame that is 
moving in the laboratory? In that frame v will be different, but what will B be? 
Does (8.1) hold in the moving frame? What is the force on the particle in the 
moving frame? In particular, suppose the reference frame is moving along with 
the particle at some instant; in that reference frame the particle has zero velocity. 
Is the force on the particle then 0 as (8.1) implies? Surely not. Is the particle 
undeflected in that reference frame? Surely not. 

The full resolution to these difficult questions will come in Chapter 12. The 
basic point is that (8.1) is only part of a more general force equation, called the 
Lorentz force 


F = q (E + v x B) . (8.3) 

In the reference frame in which v = 0, the particle still experiences a force and is 
deflected. The force on it is, however, not magnetic, but rather electric, given by 
qE, even though there is no electric field in the laboratory! The idea, which we 
will explore fully in Chapter 12, is that the fields E and B are inextricably inter¬ 
twined in the sense that they transform together if the reference frame changes. 
For this reason we properly speak of an electromagnetic field, and the subject is 
called electromagnetism. 

In principle, B can be measured by observing moving test charges, but in 
practice, B is measured with solid-state magnetometers, of which there are many 
kinds. Some examples are Hall-effect probes, 3 and magnetoresistors, which are 
materials whose resistance varies with the applied magnetic field. For measure¬ 
ments of very small magnetic fields, as in magnetoencephalography where the 
neuromagnetic fields are of order 10 _12 T, or in magnetocardiography where 

2 Named for William Crookes, one of the discoverers of cathode rays. 

3 See Exercise 1. 
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the field is of order 2 x 10 11 T, superconducting quantum interference devices 
(SQUIDs) are used. 


8.1.2 ■ Force on a Current-Carrying Wire 

A current-carrying wire in a magnetic field experiences a force, from the magnetic 
force on the individual moving charges that make the current. Electric current is 
motion of charge, and (8.1) is the force on a moving charge. Suppose the cur¬ 
rent consists of particles with charge q and linear density n\, moving with mean 
velocity v. Then the net force dF on a small segment d£ of the wire is 

dF = ( n\dl)q\ x B (8.4) 

because n\dl is the number of moving charges in d£. The current I in the wire is 
qti\v, and v is parallel to d£, so the total force on the wire is 


F = 


Id£ x B. 


(8.5) 


Biot and Savart, and Ampere, measured magnetic forces on current-carrying wires 
in their studies of the basic nature of magnetism. 


8.2 ■ APPLICATIONS OF THE MAGNETIC FORCE 

Many devices in science and technology are based on magnetic deflection of 
charged particles. In this section we will study the equations of motion, and de¬ 
scribe some applications. 

8.2.1 ■ Helical or Circular Motion of q in Uniform B 

What is the general motion of a charge q in a uniform magnetic field? We choose 
the z axis to be the direction of B, so B = fik. The equation of motion of the 
charge isdp/dt = qx x B, where p is its momentum. In general F is perpendicular 
to B, so in this case F z = 0. Therefore the z component of the motion has constant 
momentum, and v z = constant. In the special case v z = 0, the motion of the 
charge is parallel to the xy plane. If v z is not zero, then superimposed on the xy 
motion there is a constant drift parallel to the z axis. To obtain the xy motion 
we note that the kinetic energy and the magnitude |p| of momentum are constant 
because B does no work. But a constant force perpendicular to the motion, and 
constant speed, are characteristics of uniform circular motion. Thus the general 
trajectory is a helix, around k. 

If v z = 0 then the orbit is a circle, as illustrated in Fig. 8.1. For nonrelativistic 
mechanics, i.e., for v much less than the speed of light c, the equation for circular 
motion with radius R is 
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2 

m v 
~R~ 


= —F r = qvB. 


( 8 . 6 ) 


The magnetic force is centripetal. The magnitude is qvB because v is perpendic¬ 
ular to B, so that |vxB| = vB. The angular speed co of the particle is 


ca 


v 

~R 


qB_ 

m 


(8.7) 


This parameter co is called the cyclotron frequency, the period of revolution of q 
is 2n/to. For nonrelativistic motion co is independent of both the orbit radius and 
the speed. 4 

The fact that a charged particle moves on a circle in a uniform field B (for v 
perpendicular to B) is the basis for several important devices, of which we will 
describe two. 


Cyclotron 

A cyclotron is a charged-particle accelerator used for scattering experiments in 
nuclear physics, and for production of radioactive isotopes for medical applica¬ 
tions. In the classic design, shown schematically in Fig. 8.2, particles in a “bunch” 
move on semicircular orbits in the two D-shaped regions, which are called the 
“dees.” In each dee there is a uniform B perpendicular to the orbit to make the par¬ 
ticle trajectory circular. The magnetic force does no work, so to raise the kinetic 
energy of the circling particles an electric field is necessary, which resides in the 
gap between the dees. The direction of E reverses for each half cycle, so that the 
particles in a bunch are accelerated to higher energy each time the bunch passes 
through the gap. The frequency of oscillation of E in Hz is co/fln), where co is the 
cyclotron frequency. The cyclotron frequency remains constant (= qB/m) as the 
particles gain energy, as long as v <£ c. Because co is independent of radius and 
energy, multi-bunch acceleration of charges is possible in a cyclotron, by injecting 
particles at appropriate times in the cycle. 

For higher energies, i.e., v a significant fraction of c, relativistic effects become 
important. In this case the cyclotron frequency does vary with v, so the frequency 
of E must change to remain synchronized with the particle motion. This is the 
principle of the synchrotron. In a synchrotron only one bunch of charges at a time 
can be accelerated in the machine. 

By (8.6) the orbit radius increases as sjE k , where E% is the particle kinetic 
energy, so the bunch spirals out as the particles gain energy. When the particle en¬ 
ergy reaches the desired energy, the beam is extracted by a deflecting magnet, and 
directed to a target where the scattering experiment occurs. Protons, for example, 
may be accelerated to energies of order 100 MeV. 

A different design is necessary for very high energies. In a high-energy syn¬ 
chrotron the orbit radius is fixed, and the strength of B increases as the particle 

4 By (8.7) the momentum magnitude is p = mv = qBR. The solution for r elativistic d ynamics is also 
p = qBR, but p is the magnitude of the relativistic momentum p = myfq 1 — v-/c-. 
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FIGURE 8.2 Cyclotron. There is a magnetic field perpendicular to the plane of the 
dees, and an alternating electric field across the gap. The gap between the dees is greatly 
exaggerated for illustration. 


energy increases. The accelerator ring has alternating regions of B and E, the first 
to keep the particles in a circle and the second to raise their energy. The Tevatron 
accelerator at Fennilab is a synchrotron, 2 km in diameter, that accelerates protons 
and antiprotons to 1 TeV. 

Mass Spectrometer 

A mass spectrometer, shown schematically in Fig. 8.3, is a device invented by 
J. J. Thomson to separate charged particles by mass. The design in Fig. 8.3 has 
three parts, velocity selector, magnetic deflector, and particle detector. A beam 
composed of several species of ion passes into the velocity selector. In the velocity 
selector there are crossed E and B fields, E = Ej and B = Bk. If a charge has 
velocity v = (E/ B) i then it passes through the fields undeflected, 5 because the 

5 See Exercise 2. 
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FIGURE 8.3 Mass spectrometer. Positive ions moving with a definite velocity separate 
in the magnetic field according to the ratio q/m. Only those particles with v = (E/B) i 
(the x axis is to the right) make it through the velocity selector and into the magnetic 
deflector. 


net force (electric plus magnetic) is 0. Other particles will be deflected out of 
the beam. The particles that emerge from the velocity selector have this known 
velocity. 

Particles from the velocity selector then enter a region of uniform magnetic 
field, where they travel on circular paths. By (8.6) the orbit radius is proportional 
to the ratio m/q, so the different particle species travel along circular arcs with 
different radii, and separate in space, as shown in Fig. 8.3. By observing the final 
particle positions with a charged-particle detector, the experimenter can deduce 
what species of particles were present in the initial beam. Mass spectrometers are 
used to identify the products of chemical reactions, or of particles produced in 
nuclear collisions. 

The mass-spectrometer principle is used also in residual gas analysers, labo¬ 
ratory devices that are commonly employed to identify, and locate the source of, 
gas-borne impurities in high-vacuum systems. An interesting aspect of a residual 
gas analyzer is that molecules that have the same mass, and therefore the same 
ratio of m/q when singly ionized, such as CO + and N^~, are difficult to tell apart 
because their orbits are the same. 

8.2.2 ■ Cycloidal Motion of q in Crossed E and B 

As another example of the magnetic force, we will determine the motion of a pos¬ 
itive charge q that is released from rest at the origin in the orthogonal uniform 
fields E = E j and B = B k, as shown in Fig. 8.4. The motion is quite different 
from the previous case because of the electric field. When released, q initially ex¬ 
periences an electric force qE j and accelerates in the +y direction. The magnetic 
force for v in the y direction is in the x direction, so q begins to curve away from 
the y axis toward positive x. With a component of velocity in the +x direction 
there will be a component of acceleration in the — y direction. Now, what is the 
full trajectory? 
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y 



from the origin. E and B point in the y and z directions, respectively. The trajectory is a 
cycloid curve. 


The equation of motion that we must solve is 

d\ 

m — = rjr(E + vxB); 
dt 

or, in component form, where w is the cyclotron frequency, 


( 8 . 8 ) 


dv x /dt = a>Vy 

(8.9) 

, . , coE 

dvy/dt = -cov x + —- 
B 

(8.10) 

dv z /dt = 0. 

(8.11) 


We see that v z remains constant, equal to the initial value 0. Differentiating (8.9) 
and substituting for dv y /dt from (8.10), 


d 2 v x 
dt 2 


+ w 2 v x 


w 2 E 


( 8 . 12 ) 


The solution of this inhomogeneous linear equation is constructed as the sum of 
a particular solution ( v xp = E/B) plus a general solution of the homogeneous 
equation. That is, 


Jdj 

v x (t) =-h c i cos cot + C 2 sin cot , 


(8.13) 


where c\ and C 2 are constants to be determined from the initial conditions. Then 
by (8.9) 


v y (t) = -ci sin wf + C 2 cos cot. (8.14) 

The initial values u^(0) = u_ v (0) = 0 determine that the constants c i andc 2 are 

ci = —E/B and C 2 = 0. (8.15) 

It is easy to integrate v x (t) and v y (t) with respect to t to find the trajectory of q 
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in space; the result is 


x{t) 

y(t) 


- (cot — sinwf), 

coB 


-(1 — cos cot). 

coB 


(8.16) 

(8.17) 


These are parametric equations for a cycloid curve, shown in Fig. 8.4. 

The motion of q is a combination of harmonic oscillations in x and y, and a 
constant drift in the x direction. The drift velocity, i.e., the average velocity over 
one or more periods of oscillation, is 



(8.18) 


because the averages of the oscillating terms are 0. Equation (8.18) is equivalent to 


v D 


E x B 


(8.19) 


The charge drifts in a direction perpendicular to E. 

The result (8.19) has a surprising consequence in plasma physics. The drift 
velocity is the same for positive and negative charges. This fact is obvious because 

(8.19) does not depend on q. (The reader should analyze the forces qualitatively 
and figure out why vq has the same direction for + or — charges!) Therefore 
a constant electric field, in the presence of an orthogonal magnetic field, does 
not produce a net electric current in a dilute (i.e., collisionless) neutral plasma, 
because the positive and negative charges drift together. 

Some interesting variations of the cycloidal motion are explored in a computer 
exercise. 6 


8.2.3 ■ Electric Motors 

An everyday application of the magnetic force on a current is in electric motors. 
The basic design principle of any electric motor is that an electric current in a 
magnetic field experiences a force perpendicular to the current. In a simple DC 
motor the current is driven by a constant EMF, e.g., a battery. The current exists 
in a coil of wire, which is free to rotate on an axle attached to the coil. The motor 
has a magnet—either a permanent ferromagnet or an electromagnet—that exerts 
a magnetic force on the current. The force acts in opposite directions on opposite 
sides of the coil, creating a torque on the coil, as shown schematically in Fig. 8.5. 
To keep the direction of the torque constant as the coil rotates, the current in the 
coil must reverse every half cycle. The switch that reverses the current direction 
is called a commutator. In a motor with only two poles, as in Fig. 8.5, the com¬ 
mutator is a split ring consisting of two semicircular conductors insulated from 


6 See Exercise 36. 
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FIGURE 8.5 Electric motor. Because of the magnetic force on the current around the 
coil, there is a torque N on the coil. In the figure the forces on the left and right vertical 
segments point out of and into the page, respectively. 


one another. In a multipole motor the commutator ring has several conducting 
segments. If a machine is connected to the axle of the motor, then the torque on 
the coil drives the machine. 

Another type of electric motor, that uses an alternating source of EMF, is an 
induction motor. An induction motor uses a time-dependent field B(f) to create 
current in the rotor coil by electromagnetic induction (Chapter 10). Other applied 
fields exert forces on the current to drive the motor. 

Historically, the first demonstration model of an electric motor, a device using 
a magnet to produce rotation of a current-carrying wire, was built by Faraday. 
The model, illustrated in Fig. 8.6, was not a practical design, but it illustrated the 
principle. 



FIGURE 8.6 Faraday’s motor. The conducting bar is pivoted at the top at a fixed angle 
to the vertical, and its lower end is free to move through the liquid mercury. The wires 
connected to the battery are held fixed, and there is a field B due to a magnet below the 
pivot. In what direction will the bar spin? 
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8.3 ■ ELECTRIC CURRENT AS A SOURCE OF MAGNETIC FIELD 

Where does a magnetic field come from? What is its source ? The most familiar 
source is a permanent magnet, i.e., a piece of magnetized iron or other ferromag¬ 
netic material. But at a more basic level the source of B is to be found in electric 
current. In Chapter 9 we will study magnetic materials, and see that the magnetic 
field of a ferromagnet comes from properties of atomic electrons—their spin and 
orbital motion—which, although not classical currents, do involve dynamics of 
charged particles. However, in this chapter we are concerned with B from macro¬ 
scopic steady currents. 7 

8.3.1 ■ The Biot-Savart Law 

Oersted discovered that an electric current creates a force on a magnetized com¬ 
pass needle. In the language of field theory, electric current is a source of magnetic 
field. Biot and Savart 8 measured the forces between currents and magnets. 

The experiments on which the Biot-Savart law is based were done with currents 
in macroscopic wires in circuits. But to start, we consider a more elementary 
formula: The field dB at a point P, due to an infinitesimal current element Idt at 
a point P'. Figure 8.7 shows the geometry. For this elemental case the Biot-Savart 
law is 


jtio Idt x ? 
4 n r 2 


( 8 . 20 ) 


Here r is the distance between P' (the source point) and P (the field point), ? is 
the unit vector in the direction from P' to P, and r = r r. Note that the Biot-Savart 
law is an inverse-square law, like Coulomb’s law, but the direction of the magnetic 
field is azimuthal, around the axis of Idt. 



FIGURE 8.7 Elemental form of the Biot-Savart law. Idt is the source of magnetic 
field, and d B is the resulting field at P. 

7 In Chapter 11 we’ll study another source of B—the Maxwell displacement current. 

8 J. B. Biot (b. 1774, d. 1862 Paris), F. Savart (b. 1791 Mezieres, d. 1841 Paris), French physicists. 
Besides their work on magnetism, Biot had a particular interest in polarized light and Savart had a 
particular interest in the physics of violins. 
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z axis 



FIGURE 8.8 A long straight current. A source point P', at which the current element 
Id£ is located, is on the z axis at (0, 0, z). The vectorfrom the origin to P' is x r . The field 
point P, at which B(x) is calculated in Example 1, is at perpendicular distance R from the 
z axis. The cylindrical coordinates of P are (R, (f>, z). The vector from the origin to P is x 
and the vector from P' to P is r = x — x'. The inset shows the relationships among R and 
k, which are the unit vectors of cylindrical coordinates, and f = r/r. 


To get the field due to a macroscopic wire we must integrate (8.20). Then, 
letting x denote the position of P, the magnetic field B(x) is 

f Idix. r _ mo f Idl x (x - x' ) 

X 4 7Z-i w j re r 2 47 r i wire |x-x'| 3 

Here r = x — x' is the vector from the source point x', i.e., the position of the 
wire element d£, to the field point x, as illustrated for the special case of a straight 
wire in Fig. 8.8. Again, r = |r| and r = r/r. We take (8.21) as an empirical 
law, deduced from experiments on the forces between currents and magnets. The 
parameter /iq, called the permeability of the vacuum, depends on the system of 
units. In SI units it is assigned the value = 47T x 10~ 7 Tm/A, exactly; that is, 
the unit of current (ampere) is defined such that /iq has this value. 

Examples 1 and 2 below show that calculating B from the Biot-Savart integral 
is not simple, because of the vector character of the integral. We will later develop 
simpler methods for calculating B, especially by Ampere’s Law. Nevertheless, it 
is important to see how (8.21) works, by studying these examples. 
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EXAMPLE 1 Determine the magnetic field due to a current / in a long straight 
wire. 

Figure 8.8 shows the geometry. The direction of / is k. The field point x has 
cylindrical coordinates ( R , <p, z). In (8.21) the length element d£ is k dz' at the 
source point x ' = k z!, and 

r = |x - x'| = y/\ z - z >)2 + R 2 (8.22) 


SO 


B(x) 


mo/ 

4tz 


i : 


k x ? dz' 

(z - z') 2 + R 2 


(8.23) 


Note that we have approximated the wire by an infinite line—an idealization but a 
valid approximation for points x near the wire and far from the ends. We assume 
the return path for I is so far away that it makes a negligible contribution to B 
at x. In this approximation B is independent of z so without loss of generality we 
can let z = 0. 

The unit vector ? points in the direction from the location of the current element 
Idi to x. Examining Fig. 8.8 we see that 


r = Rcos \jz + ksin^, (8.24) 

where R is the cylindrical radial unit vector at x and iz is the angle between R 
and r. The ability to expand one unit vector as a linear combination of other unit 
vectors is a useful skill in vector analysis. First note that ? is in the plane spanned 
by R and k, and write r = c\ R + C 2 k. Then determine the coefficients by dotting 
? with R and k: C| = ? • R = cos \jz and C 2 = ? • k = sin i/r. (See the inset in Fig. 
8.8.) Hence (8.24). 

Now, in (8.23) k x f = ^>cos \jz and cos i fr = R/r. Making these substitutions 
we have 


B(x) = ^-$R j ’ 


dz' 


4?r -- J_ oo (z /2 + R 2 )^ 2 ' 

The evaluation of the z' integral is an exercise in calculus, 

dz' z! 


£ 


/-oo 0 z a + R 2 ) 3/2 R 2 (z' 2 + R 2 ) 1 ! 2 

Thus the magnetic field of a long straight wire is 

Mo/ z 


1_ 

R 2 ' 


B(x) = 


2t tR 


<t>- 


(8.25) 


(8.26) 
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The direction of B is azimuthal. That is, B “curls around the current,” which is 
a useful mnemonic for the direction of B. With the thumb of your right hand 
pointing in the direction of /k, the fingers curl in the direction of B. 


EXAMPLE 2 Determine the magnetic field due to a circular current loop, at an 
arbitrary point on the axis of symmetry. 

Let the z axis be the axis of symmetry, with the current loop in the xy plane, 
as shown in Fig. 8.9. The loop has radius a and current I. The line element at 
angular position cp on the loop is di = <pad<p, so by (8.21) 


B(z) 


IM)I 

An 


,2.t 

Jo 


cup x f dcp 
a 2 + z 2 


(8.27) 


The field point P is on the z axis at (0, 0, z)- The unit vector r points in the 
direction from di to the field point zk. The inset in Fig. 8.9 shows that 

? = — Rcost/r + ksint/r (8.28) 

where R is the cylindrical radial vector and p is the angle between ? and —R. 
Therefore in (8.27) <fi x ? is a sum of two terms, 


z axis 



FIGURE 8.9 A circular current. The field at the point x = zk is calculated in Example 
2 from the Biot-Savart law. The element di of the wire is located at the point x = aR. 
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4> x ? = kcos \f; + Rsini/r. (8.29) 

In (8.27) x r is integrated from 0 to 2n in (p. The second term in (8.29) integrates 
to 0; this can be seen by symmetry, or more directly by noting that R = i cos (p + 
j sin <p and both cos (p and sin (p have integral 0. Therefore the field at zk is in 
the k direction. Note, too, from Fig. 8.9 that cos \fr = a/r. The evaluation of the 
remaining integral is trivial because the integrand is independent of <p, and the 
result is 


B(z) = 


/x 0 Ia- 


2 ( a 2 + z 2 ) 


3/2 


(8.30) 


Asymptotically, as z —*■ oo, the magnetic field decreases as 1/z 3 - 

On the z axis the field points in the k direction. This is again consistent with 
the mnemonic that B curls around the current. Anywhere on the loop, if the thumb 
of your right hand is tangent to the circle then the fingers curl up through the loop 
in the k direction. (Conversely, if the fingers curl with the current then the thumb 
points along B.) 


8.3.2 ■ Forces on Parallel Wires 

Consider two long parallel wires, with currents I\ and /t, having length l and 
separation d. The situation is shown in Fig. 8.10. The force on wire 2 is 

F 2 = J I 2 di 2 x B, (8.31) 

where Bi is the field due to wire 1 at the position of dt 2 - The field Bi may be 
calculated from (8.26), neglecting end effects, i.e., treating Bi as constant along 


wirel 


wire2 


h 


h 




T B, 


FIGURE 8.10 The forces on parallel wires. 
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wire 2, which gives 


8, = (8.32) 

this is a valid approximation for Bi if i d. The vector di 2 x 4>\ is r 2 idf 2 , 
where r 2 i lies in the plane of the wires and points from wire 2 toward wire 1. 
Therefore the force on wire 2 is 


F 2 = 


mhhi „ 

-r?i. 

2nd 


(8.33) 


The force on wire 1 is equal but opposite to F 2 . 

We see from (8.33) that if the currents I\ and / 2 are in the same direction then 
the force is attractive; or, if they are in opposite directions the force is repulsive. 
For static charges, opposites attract; but for currents, opposites repel. The reader 
should study Fig. 8.10, figure out the current, field and force vectors, and verify 
that the direction of the forces is as described. 

Ampere studied the force between currents, and determined by experiment the 
result (8.33). However, Ampere’s work predated the development of field theory. 
Ampere attributed the force to “action at a distance” between currents. Field the¬ 
ory is different: Wire 1 produces a field Bi throughout the space, and that field at 
wire 2 exerts the force F 2 . 

The SI unit of current, which is the basic SI electrical unit, is the ampere (A), 
or “amp” for short. The ampere is defined by the statement that the force per unit 
length on long parallel wires is exactly 2 x 10 _7 N/m if I\ — / 2 = 1 A and 
d = 1 m. It follows that no has exactly the value 4n x 10“ 7 Tm/A. 


8.3.3 ■ General Field Equations for B(x) 

Equation (8.21) gives the field produced by a one-dimensional current, e.g., I in 
a thin wire. We must generalize the equation to the case of an arbitrary volume 
current density J(x). The generalization follows from the superposition principle. 
Recall that the dimensions of J are A/m 2 , so J • dA is the current through a small 
area element dA. Consider an infinitesmal volume d 3 x' at x', consisting of area 
dA perpendicular to J(x') times length di parallel to J(x'). The current through 
dA is J • dA = J dA, and the contribution to B(x) from this small current element 
is, according to the Biot-Savart law, 


dB(\) = 


/U-o JdA di xr 
An r 2 


(8.34) 


where again r = x — x'. Now, dA di is just the volume d 3 x', and di is parallel to 
J(x'). Thus the complete field at x is 


B(x) = 


d 0 f J(x') x (x-xQ 
An J |x -x'| 3 


(8.35) 



268 


Chapter 8 Magnetostatics 


(Note that r = |x — x'| and r = (x — x')/|x — x'|.) This is the generalization of 
the Biot-Savart law (8.21) to an arbitary current distribution throughout a volume 
of space. 

The general equation (8.35), obtained by replacing the infinitesimal linear cur¬ 
rent element Idi by the infinitesimal volume current element J (x')d^x', can also 
be extended to surface currents. Thus if K is a current per unit length flowing 
on a surface S' then the “current times length” dl dt is K (x')dA'. We picture K 
as the current crossing unit length perpendicular to the flow direction. As a rule 
the substitutions Idi —*■ K{x')dA' -*■ J (x')d 3 x' apply in the Biot-Savart law 
(8.21), and in the magnetic force law (8.5). These substitutions are analogous to 
the electrostatics cases (Chapter 3) where we have dq —*■ cr(x')dA' — > p(x')d^x' 
connecting, respectively, infinitesimal charges, surface charge densities, and vol¬ 
ume charge densities. 

Equation (8.35) is one of the basic equations of magnetism. It determines the 
magnetic field created by any steady current source. Equation (8.35) is analogous 
to the Coulomb integral (3-9) for a static electric field. In electrostatics the charge 
density p(x') creates the electric field E(x); in magnetostatics the current density 
J(x') creates the magnetic field B(x). 

From (8.35) we will derive two partial differential equations satisfied by the 
field B(x). We will eventually prove that 

V • B = 0, (8.36) 

and 

V x B = p-oJ- (8.37) 

Equation (8.36) is universal, i.e., always true, assuming that magnetic monopoles 
do not exist. 9 Equation (8.37), called Ampere’s Law, is only valid for magneto¬ 
statics, i.e., for the field from a current distribution that is constant in time. 

The last point in the previous paragraph will be very important when we con¬ 
sider time-dependent fields, so it deserves some more explanation here, although 
that will take us away from magnetostatics briefly. Recall from Chapter 2 that 
the divergence of the curl of any vector function is identically 0. In particular, 
V • (V x B) = 0. Therefore (8.37) implies 

V • J = 0. (8.38) 

This equation is satisfied in magnetostatics. Recall the continuity equation, V J = 
—dp/dt, which was discussed in Chapter 7 and is always.true. For a static system 
dp/dt is 0, so then V • J = 0. However, for time-dependent systems (8.38) is not 
true, and (8.37) must be changed by including the displacement current. That is. 


9 See Sec. 8.5. 
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generally J in (8.37) must be replaced by J + eo3E/3 1 , so that 
V x B = mo (J + eo3E/3r), 

one of the Maxwell equations. The divergence of the right-hand side is always 0: 

/ 3E\ dp 

V.(j + eo -) = V.J + ^=°, 

because eoV • E is equal to p by Gauss’s Law. We shall study the time-dependent 
Maxwell equation in Chapter 11, but we restrict ourselves in the present chapter 
to static systems, with (8.37). 


Returning to magnetostatics, we must prove (8.36) and (8.37). The starting 
point is the integral representation of B in (8.35). Now, note that 


(x-xQ _ 1 

Ix-x'l 3 Ix-x'l' 


(8.39) 


Make this replacement in (8.35), and reverse the order of terms in the cross prod¬ 
uct, remembering to change the sign. Next, rewrite the integrand using the vector 
identity 



x J(x') = V x 



(8.40) 


Finally, pull the curl operator out of the integral in (8.35), which is allowed be¬ 
cause Vx is the curl with respect to x, whereas the integral is over x'. (It may help 
to write the curl in Cartesian coordinates to see that this step is correct.) Hence 


B(x) = V x 


MO 

f J(xV 3 x'" 

4 7t J 

Ix-x'l _ 


(8.41) 


Equation (8.41) is important. It says that B(x) is equal to the curl of a vector 
function, B = V x A. The function A(x) is called the vector potential and will be 
discussed further in Sec. 8.5. The first field equation (8.36) follows immediately 
from (8.41), because the divergence of the curl of a vector function is identically 0. 

Equation (8.41) is a special case of the Helmholtz theorem, which is discussed 
and proven in Appendix B. Any vector function that approaches 0 at oo can be 
written as the sum of a curl and a gradient, as in equations (B.3) to (B.5). Applying 
the theorem to B, the gradient term (B.4) is 0 because V ■ B = 0. The curl term 
is V x A. Furthermore, comparing (8.41) and (B.5) we see that V x B = MoJ- 
This proves, by a rather roundabout argument, the second field equation (8.37). 
Because Ampere’s law (8.37) is very important, a more direct proof will be given 
in Sec. 8.4. 
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8.4 ■ AMPERE'S LAW 

Equation (8.37) is the differential form of Ampere’s law: V x B = /xqJ. We may 
write an equivalent integral form. Let C be an arbitrary closed curve (loop) in 
space, and consider the line integral of B around C. Also, let S be any surface 
bounded by C, as illustrated in Fig. 8.11. By Stokes’s theorem, the loop integral 
of B is equal to the surface integral of V x B on S. That is, using (8.37), 

<j> B • di = /Uq J J • dA = p,qI (enclosed). (8.42) 

This is the integral form of Ampere's law, also called the circuital law. / is the 
current flowing through S, or equivalently enclosed by C. The flux of J is inde¬ 
pendent of S, for a given boundary C, because V • J = 0. Recall from Stokes’s 
theorem that the directions of di and dA are correlated by the right-hand rule: 
With the thumb pointing in the direction of dA, the fingers curl in the direction 
of di. 

We derived the integral (8.42) from the differential equation (8.37). We can 
reverse the derivation, because the integral mustholdfor any closed curve C. The 
differential and integral relations are equivalent. 

Although we are postponing the proof of Ampere’s law until the end of this 
section, we can easily check that the integral relation (8.42) is true for some simple 
cases. Let I be the current in a long straight wire, for which the magnetic field is 
(8.26). First, suppose C is a circle perpendicular to I with I at the center, shown 
in Fig. 8.12(a). Then 


j) B • di = (mo/ /2nr)(2nr) = ixqI, 

consistent with (8.42). Second, suppose C is the boundary of an angular section 
cut out of a circular annulus centered at I, shown in Fig. 8.12(b). Along the radial 
sides of C, B • di = 0; along the azimuthal arcs, 



FIGURE 8.11 Geometry for Ampere’s Law. 
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FIGURE 8.12 Two examples of Ampere’s Law. (a) A circle enclosing a wire; (b) an 
annular section not enclosing the wire. 



B • dt = ±(jU,o//27rr)(r A0) = ±fM 0 I (Acp/27T), 


where Acp is the angle of the section. The arcs cancel in the loop integral, so 
<f B • dt = 0, consistent with (8.42) because there is no current enclosed by C. 


8.4.1 ■ Ampere Law Calculations 

For current distributions with a high degree of symmetry it is often possible to 
derive the magnetic field by direct application of the circuital law, taking into ac¬ 
count the symmetry. The next four examples illustrate the method. When studying 
these examples, the reader should draw a picture showing the current, field, and 
Amperian loop. 


EXAMPLE 3 What is the field due to a current I in a long straight wire? We 
previously calculated B(x) from the Biot-Savart law. Here we will determine the 
field from (8.42). 

Approximate the wire as an infinite line. By the cylindrical symmetry of the 
problem, and the fact that B “curls around” the current, B must be azimuthally 
directed around the wire. The magnitude of B cannot depend on either the po¬ 
sition z along the wire or the angle (p around the wire. Therefore in cylindrical 
coordinates, B(x) must have the form 

B(x) = B(r)4>. (8.43) 


Now, apply (8.42) to a circular loop centered at the wire with radius r. The loop 
integral is ^B ■ dt — 2nrB(r), and this must equal Bol by (8.42), so 


B(r) = 


Bo I 
2 nr 


(8.44) 


The result is, of course, the same as (8.26), but the derivation here is much easier. 
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Amperian 



FIGURE 8.13 Example 4. The magnetic field produced by a current sheet can be deter¬ 
mined from Ampere’s Law. 


EXAMPLE 4 What is the magnetic field due to a planar current sheet? 

Referring to Fig. 8.13, let K = K i be the suiface current density (current per 
unit of length perpendicular to the current) on the xy plane. That is, the current is 
in the x direction, and Kdy is the charge per unit time passing across the perpen¬ 
dicular length dy. By symmetry, and the fact that B curls around the current, B(x) 
must have the form 


-fl(lzl) j for z > 0 
+£(|z|) j for z < 0, 


(8.45) 


as shown in Fig. 8.13. In this equation we have allowed the field to have z depen¬ 
dence, although we shall soon see that it is independent of z. Now, apply (8.42) 
to a rectangular loop of size L x 2z parallel to the yz plane and bisected by the 
xy plane, also shown in Fig. 8.13. Let the top side of the loop be at +z and the 
bottom side at — z. The loop integral is j> B • di = 2BL, and the current enclosed 
by the loop is KL; so by (8.42) 


B(z) = noK/2. (8.46) 

Note that the field is independent of the distance z from the plane, because we 
have considered an infinite plane of current. Recall that E due to an infinite 
charged plane is similarly independent of the distance from the plane. We say 
in problems involving only infinite planes “there is no scale of length.” Equation 
(8.46) is a valid approximation for the magnetic field near a current sheet far from 
the edges. 


An extension of Example 4 is to find B everywhere in space for a configuration 
with surface current K = +K i on the plane z = +a, and K = — K i on the plane 
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z = —a. This configuration, called a bus bar, with two parallel conducting plates 
carrying large currents in opposite directions, is used in power stations. By the 
superposition principle, i.e., adding the fields of the two plates, the magnetic field 
is n 0 Kj between the plates, and 0 above or below the plates. In practice, the 
two parallel conductors of a bus bar are parallel conducting ribbons, not infinite 
planes. But because the separation between the ribbons is small compared to their 
width, the simplifying assumptions we’ve used hold to a good approximation. 


EXAMPLE 5 As an extension of Example 4, we now find the magnetic field due 
to the constant volume current density J = Jo i flowing in the conducting slab 
shown in Fig. 8.14. The slab extends between —a<z<a and is infinite in the x 
and y directions. 


z 




(b) 


FIGURE 8.14 A slab of current, (a) The slab geometry. The current density in the slab 
is uniform, J = Jq i . The magnetic field due to this current is in the ±j direction, (b) The 
magnetic field as a function of z. 
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This problem has enough symmetry that it can be solved by Ampere’s law, but 
first the symmetry must be exploited to gain insights as to what form B may have. 

Notice first that B cannot depend on the coordinates x and y because there is 
no variation in the geometry as these coordinates are changed; thus, B = B(z)- 
Notice next that the finite slab can be considered as a stack of infinitesimal slabs, 
or current sheets, of thickness dz, each carrying the current per unit length dK i = 
Jod z i. The idea is that each section of slab, with unit length in the y direction and 
dz thick, carries current Jodz. in the x direction. But, as we know from Example 4, 
the field due to such a current sheet is constant in magnitude and parallel to the y 
axis, in the — y direction above the sheet, and in the +y direction below the sheet. 
The field due to the stack of such sheets must also be parallel to the y axis, so we 
may write B = B y (z) j. Notice finally that B = 0 on the plane z = 0 because 
the field due to the current sheet at +z cancels the field due to the symmetrically 
opposite one at — z. 

We will apply Ampere’s law (8.42) to the two rectangular loops shown in Fig. 
8.14. The direction of traversal of all the loops is counterclockwise, consistent 
with the right-hand rule and current out of the page. 

First apply (8.42) to the loop afiySa, which has dimensions l x z with z > a. 
The only nonzero contribution to ^B • dl comes from the segment yS, because 
B = 0 along afi, and B is perpendicular to dl along fiy and 5a. Using (8.42), 
cfB-dl = By(z)(—l) = noJola- The reason for (—£) in the line integral is that 
the integral from y to 8 is in the — j direction. The total current passing through 
afiySa is Jola\ there is no current for z > a. The result is B y (z ) = —ixoJoa. 

The field is constant above the slab: B = —/roToaj, no matter how large 
z is, as long as z > a. We can also obtain this result by integrating the con¬ 
tributions of infinitesimal current sheets making up the finite slab. By (8.46), 
d B = —jiM)dK j, where dK = Jodz:, the integral over — a < z < a gives 
again B = — noJoa j. Similarly, considering the region z < —a below the slab, 
one finds there B = +noJoa j . 

To find B in the region 0 < z < a, inside the slab and above the mid¬ 
plane, apply (8.42) to the loop a' fi'y'S'a', of dimensions l x z with z < a. The 
only nonzero contribution to § B • dl is from the segment y'S', so j B • dl = 
By(z)(—l) = iM)Jolz.. Because the loop is traversed counterclockwise, the right- 
hand convention requires that current in the +x direction (i.e., out of the page) be 
positive. Hence for z > Owe wrote the current as JqIz, which is proportional to 
the height z. The result is B y (z) = —noJoz . Similarly, for the region — a < z < 0, 
the field is B y (z) = —fxoJoz', notice that z is negative in this region so B points in 
the positive y direction. 

Figure 8.14(b) is a plot of B y (z) versus z. Note that B y is continuous at the 
interfaces z = ±a because there is no surface current there. Recall from Sec. 
7.2 what is meant by surface current: There must be finite current on the sur¬ 
face S through a rectangle of infinitesimal height dz perpendicular to S. (The 
current through a unit length of the rectangle due to the volume current is only 
the infinitesimal Jodz-) For a surface with zero surface current, the tangential 
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components of B are continuous across the surface. On the other hand, B y (z) is 
discontinuous at z = 0 in Example 4; in general the tangential component of B 
has a discontinuity at a current-carrying surface, proportional to K. 


EXAMPLE 6 What is the field due to a long, densely wound cylindrical solenoid 
with circular cross section? 

A solenoid is a coil of wire carrying current I wound around a cylinder. If the 
coil is densely wound then the current is approximately the same as a current sheet 
rolled up into a cylinder, as illustrated in Fig. 8.15. (The surface current density is 
K = «/</>, where n is the number of turns per unit length.) We consider this ideal 
case. It would be difficult to find B directly from the Biot-Savart integral, but it is 
easy by Ampere’s law. 

By symmetry, treating the solenoid as infinitely long as well as densely wound, 
B(x) must point in the direction k of the solenoid axis, and cannot depend on 
the position z along the axis, nor on the azimuthal angle <p around the z axis. 
That is, in cylindrical coordinates, B(x) = B(r)k. First consider a rectangular 
Amperian loop C i outside the solenoid; see Fig. 8.15. The loop integral taken 
in the counterclockwise direction around C\ is ) — B(r 2 )] i, and this is 0 
by the circuital law because there is no current enclosed by C\. Thus B(r) is 
constant outside the solenoid. Furthermore, the constant must be 0 because B is 0 
at r = oo. B must be 0 at oo because it is the field due to a finite current I. We 
conclude that 


B = 0 outside. (8.47) 



FIGURE 8.15 Example 6. The magnetic field of a long solenoid can be calculated from 
Ampere’s Law. C\ and Cj are Amperian loops used in the analysis. The figure shows a 
finite solenoid for convenience but Example 6 considers the ideal case of a closely wound 
solenoid of infinite length. 
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This result is for an ideal solenoid. For a real solenoid with one layer of wire 
wound on a cylinder, there is a component of current in the axial direction, so that 
an Amperian loop surrounding the solenoid (coaxial with the cylinder) encloses 
the current I. Then B has a small azimuthal component outside the solenoid, 
B(p(r) ~ iM)I /(2nr) for a circular solenoid. This external field would be very 
weak compared to the field inside the solenoid. 

Next consider a rectangular Amperian loop C 2 cutting through the solenoid 
surface. The loop integral around C 2 is B{r)l\ this is equal to nonll by the cir¬ 
cuital law, where n is the number of turns per unit length, because nl I is the total 
current enclosed by C 2 . Thus we have 

B = fiQtilk inside. (8.48) 

Note that K = nltfi is the surface current density, so B — tu)K inside the 
solenoid. Remarkably, B is constant in the solenoid —independent of r. 

We derived (8.47) and (8.48) for a solenoid with circular cross section, and can 
show that the same equations hold for a square cross section, or indeed any cross 
sectional shape, assuming the shape is constant in the z direction. 10 To prove the 
latter result, build up the generalized cylinder from infinitesimal square solenoids 
inside it in such a way that all currents cancel except those on the outer surface. 


Example 6 is an infinite solenoid—an idealization that approximates a long 
solenoid (length radius) far from the ends. But real solenoids do not all satisfy 
these simplifying assumptions. The field on the axis of a densely wound solenoid 
of finite length can be calculated 11 by integrating dB of the circular coils that 
compose the solenoid; we know dB from Example 2. Figure 8.16 is a plot of 


solenoid length 


B(z)/B 0 



z/a 

FIGURE 8.16 Magnetic field of a finite solenoid. The field is plotted along the axis of 
a circular cylindrical solenoid for which the ratio of length to diameter is 4. B(z) is plotted 
in units of Sq, the field of an infinite solenoid with the same surface current. 

10 Equations (8.47) and (8.48) are correct for any cross sectional shape because they satisfy the field 
equations and the boundary conditions. See Chapter 9. 

1 *See Exercise 13. 
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FIGURE 8.17 A loosely wound solenoid. 


B Z {Q, 0, z)/Bo for a solenoid of radius a centered on the z axis, extending from 
z = —4 a to z = 4a; the reference field Bo is the field nonl of an infinite solenoid. 

Figure 8.17 shows the magnetic field for a solenoid that is loosely wound 
and finite in length, i.e., a helical wire. The reader should imagine how the ideal 
solenoid emerges as the limit of very closely wound coils. 

Solenoids are commonly used for electromechanical relay switches. Current 
produces a magnetic field that pulls a ferromagnetic plunger into the solenoid, 
closing the switch. The starter solenoid of a car works this way; a small current 
supplied through the ignition switch to a solenoidal coil closes a circuit through 
which the battery supplies a much larger current to the starter motor. 

Examples 3, 4, and 6 are summarized in Fig. 8.18. Note in each case how B 
curls around I . 

8.4.2 ■ Formal Proof of Ampere's Law 

We have seen how useful Ampere’s law can be. But how is it related to the Biot- 
Savart law? We will construct a mathematical proof of Ampere’s law, starting 
from the Biot-Savart law. The differential and integral forms of Ampere’s law are 


straight wire current sheet 


solenoid 



FIGURE 8.18 Fields and currents of Examples 3, 4, and 6. Note in each case how the 
field curls around the current. The ideal analytical solutions require translation invariance 
in each case. 
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FIGURE 8.19 Any finite current distribution may be subdivided into thin loops. 


equivalent, so we may prove either one. We’ll prove the integral relation (8.42). 
The mathematics of the proof is difficult but beautiful. 12 

The starting point is the Biot-Savart law (8.21), which is an experimental fact. 
Now, it is sufficient to consider as the source of B a current I on a closed curve, 
because any finite current distribution can be subdivided into thin loops, as illus¬ 
trated in Fig. 8.19. By the superposition principle, if Ampere’s law is true for one 
loop then it is true for any superposition of loops. 

Now consider Fig. 8.20(a). Let C' be the closed curve carrying current /. For 
simplicity we assume that C is planar, but the proof can be generalized to a non- 
planar current loop. For an Amperian loop C (see Fig. 8.20(a)) we will evaluate 
formally the circulation integral of B, defined as 

y = (^B-dx = J]B-<5x (8.49) 

5x 


SX y/ 




(a) 

FIGURE 8.20 Proof of Ampere’s Law. (a) C' carries current / and C is an Amperian 
loop, (b) — Sx x dx! is an area element of the surface that would be swept out if C' were to 
be displaced by — Sx. 

12 J. C. Slater and N. H. Frank, Electromagnetism (Dover, New York, 1969), Chapter 5. 
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where the limit |<5x| —> 0 is understood. We must prove that y = ixqI if C and C' 
are linked, but y = 0 if C and C' are not linked. 

By the Biot-Savart law, 


B(x) 


4n Jc 


dx! x r 


(8.50) 


where r = x — x' is the vector from the source point x' on C' to the field point x 
on C. The contribution to y from the small segment <5x of C is 

Sy = BL I = 1 (8.5!) 

4 n Jc 1 r 2 4;r Jc r 2 

where the second form follows from a vector cross product identity. Now, — S\ x 
dx' is a certain area element, and we will evaluate the integral by analytic geom¬ 
etry. 

Let C^ is denote the loop that results from displacing C' by — <5x, as shown 
in Fig. 8.20(b). Also, let 81Z denote the surface between C' and C^ is (a loop of 
ribbon). 8'JZ is the surface that is swept out by C' as it is translated by — 8x. Figure 
8.20(b) demonstrates that — 8x x dx' is an area element dA of the ribbon 811, 
shown as a shaded region in the figure. The key to this proof of Ampere’s law is 
to see that — f • dA/r 2 is the solid angle subtended by dA viewed from x (— f is 
the unit vector pointing from x to x'). Therefore the integral around C’ in (8.51) 
gives the net solid angle <5S2 subtended by the ribbon loop 811, 


l 


(<5x x dx!) • r 


= 8Q. 


(8.52) 


But (5S2 is just the difference between the solid angles and subtended, 
respectively, by the planar surfaces bounded by C’ and C^ is ; that is, <5S2 = L2' — 
f2j is . As C’ moves away from x (by — <5x) the solid angle decreases; that is, £2j is < 
fT. In other words, letting S' denote the planar surface bounded by C', 8H is the 
decrease of the solid angle subtended by S' either when C’ moves by — 8x with 
x fixed, or when x moves by +<5x with C' fixed. The sign convention here is that 
Q' is positive viewed from a point above S', as in Fig. 8.20(b), and negative when 
viewed from a point below S'; equivalently, the direction of an area element dA' 
on S' points downward. 

Now we can evaluate the circulation integral y in (8.49). Combining (8.51) 
and (8.52) we have 


y = 


X> = £ 

<5x 5x 


47r 


[tt'(x) 


Q'(x -(- 5x)], 


(8.53) 


where f2'(x) and L2'(x + <5x) are the solid angles subtended by S' viewed from x 
and x + <5x, respectively. Adding all the small differences of solid angle as x steps 
around the loop C yields the net change in the solid angle subtended by S', so 
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Y = (-Atf) , (8.54) 

4 n 

where AQ' is the net change f^ nal — Q |nilia| from an initial point on C infinitesi¬ 
mally above S' to a final point on C infinitesimally below S'. 

If C threads through C ', as in Fig. 8.20(a), then A £2' = —47r: Imagine travers¬ 
ing C starting from a point infinitesimally above the planar surface S' bounded 
by C'. The initial solid angle subtended by S' is 2jt. As x moves around C, ST 
decreases, becomes 0 at some point, and then negative. As x approaches the final 
point infinitesimally below S', ST approaches —2jt. The net change is AST = 
—4tr. Hence Ampere’s law is proven, because y = p,$I and the current I is 
enclosed by C. 

If C does not thread through C' then the initial and final solid angles, subtended 
by C' at points infinitesimally above and below the plane of C' respectively, are 
both 0. In this case AST = 0 so the circulation y is 0, in accord with Ampere’s 
law because zero current is enclosed by C. 

The integral form of Ampere’s law is thus proven. The proof is formal rather 
than practical. It shows that Ampere’s law and the Biot-Savart law are not inde¬ 
pendent principles, but equivalent. 


8.5 ■ THE VECTOR POTENTIAL 

At this point it is useful to compare the principles of magnetostatics and electro¬ 
statics, given in the table below. 


Magnetostatics 

Electrostatics 

V x B = ju-oJ 

V • E = p/e 0 

V • B = 0 

V xE = 0 

B curls around 7 

E diverges from q 


Note this difference: The electric field has scalar sources, but the magnetic 
field has vector sources. Electric charge, i.e., a point source of E from which 
E diverges, is a common property of elementary particles. Motion of charged 
particles—current—is the source of B, around which B curls. 

Magnetic charge, i.e., a point source from which B diverges in the rest frame of 
the particle, apparently does not exist in nature, or at least it has not been observed. 
Such a hypothetical charge is called a magnetic monopole. Dirac showed that it is 
possible to construct a consistent quantum theory with both electric charges and 
magnetic monopoles. However, the fundamental magnetic charge g and electric 
charge e would necessarily be quantized, and satisfy the relation eg = n/2, where 
n is an integer. Many experimenters have searched for magnetic monopoles, but 
so far the results are negative. Some speculative theories of high-energy physics, 
such as grand unified field theories, predict the existence of very massive mag¬ 
netic monopoles, too massive to be produced at current high-energy accelerators, 
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but which might have been produced in the big bang. Searching for magnetic 
monopoles continues to be an interesting experimental challenge. 

If magnetic monopoles do not exist, then the equation V • B = 0 is a universal 
equation of magnetism. Whether magnetic monopoles exist or not, the source of 
the magnetic fields we encounter in physics are not point magnetic charges but 
rather currents of electric charge, corresponding to the source equation V x B = 
MoJ- 

We found in electrostatics that it is useful to introduce a scalar potential V (x) 
for the electrostatic field, such that E = — VV. This guarantees that V x E = 0. 
In an analogous way we may introduce a vector potential A(x) for the magnetic 
field, such that 


B = V x A. (8.55) 

This guarantees that V • B = 0. Indeed, we have already seen B(x) written as a 
curl in (8.41). 

However, (8.55) does not uniquely determine A for a given magnetic field B: If 
/(x) is an arbitrary scalar function, then A + V/ has the same curl as A (namely 
B) because V x V/ is identically 0. Therefore we may impose a condition on A, 
called a gauge condition, to remove this ambiguity. The Coulomb gauge condi¬ 
tion, which we will use in magnetostatics, is 

V • A = 0. (8.56) 

Taking (8.55) and (8.56) together is still not enough to make A(x) unique, be¬ 
cause adding a constant does not change either V x A or V • A. But imposing 
an appropriate boundary condition, such as requiring A —» 0 at infinity, makes A 
unique. 

In Sec. 8.5.1 we will find an integral formula for the potential A(x) of localized 
current sources. But first, as a simple example of the vector potential, consider 
the uniform field B = Z?ok. A vector potential function, satisfying (8.55) for the 
uniform field and the Coulomb gauge condition (8.56), is 

A(x) = jB x x = jBq yi +xj^ . (8.57) 

An example of uniform B is the field inside a solenoid, so for B = Z?ok we may 
picture a long, tightly and uniformly wound solenoid, whose axis is the z axis. 
Note for this case that A(x) is parallel to the surface currents (azimuthal) and that 
A(x) curls around the B field. 


8.5.1 ■ General Solution for A(x) 
By Ampere’s law we have 


V x (V x A) = MoJ- 


(8.58) 
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The double cross product can be reduced to 

V x (V x A) = V(V • A) - V 2 A = -V 2 A, (8.59) 


the last equality by the Coulomb gauge condition (8.56). Now (8.58) becomes 
—V 2 A = MoJ, which is Poisson’s equation for each Cartesian component of 
A(x). Recall from Chapter 3 how Poisson’s equation can be solved with the 
Green’s function of —V 2 , which is l/47r|x — x'|. The solution is 13 


A(x) = 


Mo f J (\')d 3 x' 
4 j t J |x — x'| 


(8.60) 


For self-consistency we must have V • A = 0, which is true because V • J = 0. 14 
(Show V • A = 0 by a trick: V acts on the x in |x — x'|. But V on a function of 
x — x' is the same as —V'. In the latter form integrate by parts; the surface term is 
0 because J = 0 at infinity, and the other term is proportional to V' • J(x'), also 0. 
Exercise 21 supplies the details.) 

The vector potential in (8.60) is the same as in (8.41), the result obtained earlier 
from the Biot-Savart law. In other words, we obtain the same field from the Biot- 
Savart law and from Ampere’s law. This provides another proof that Ampere’s law 
is equivalent to the Biot-Savart law. We have written A in the Coulomb gauge, but 
the equivalence of Ampere’s law and the Biot-Savart law is independent of gauge 
choice because these laws involve only the gauge-invariant quantities B and J. 

Equation (8.60) is an example of the superposition principle. The small current 
source J(x')d^x' contributes its part to the potential, dA(x)\ the full potential is 
A(x) = f dA(x). Note from (8.60) that the contribution dA(x) from any current 
element Mx'jd^x' is parallel to the current element. 


EXAMPLE 7 Determine the vector potential for the magnetic field produced by a 
hollow charged sphere that rotates with constant angular velocity u) = oik. The 
sphere is shown in Fig. 8.21. 

Let a be the surface charge density, and a the radius of the sphere. The surface 
current density at a point x' on the sphere is K(x') = ctv(x'), where v(x') is the 
velocity at the point x'. The direction of v(x') is azimuthal and the magnitude is 
wa sin 6, where 6 is the angle between k and x'. Therefore v(x') = cok x x', and 
the surface current density is K(x') = cru) x x'. 

Now calculate A(x) using (8.60). Divide the surface into small patches dA' = 
a 2 d£2', and replace the current element J(x')d^x' appropriate for a volume by 
K(x')dA' for the surface; that is, 

1 Equation (8.60) is safe in a Cartesian coordinate system, where the direction vectors are constants. 
In spherical or cylindrical coordinates (8.60) may be misunderstood, because the directions of the 
basis vectors in the integrand depend on the point x'. which varies over the integral, and not on x. For 
example, to find A r (x) = r • A(x) we need more than J r (x'). 

14 See Exercise 21. 



8.5 


The Vector Potential 


283 


z axis 



FIGURE 8.21 Example 7. A hollow sphere with surface charge density a rotates with 
angular velocity u> = cok. What is the magnetic field? The point x' is a source point and 
the point x is a field point. 


where 


A(x) = 


Mo f K (x')dA' 
4it J |x - x'| 


Mo era 2 
47r 


u> x f(x), 


f(x) = 


/ 


x'dQ' 
lx - x'| ’ 


(8.61) 


(8.62) 


Again, x' is a source point on the sphere with x'| = a, and x is an arbitrary field 
point with |x| = r. The integral is over all points on the sphere, and we have used 
dA' = a 2 dQ' to express it as an integral over all solid angles. 

To evaluate the integral for f(x) requires some clever tricks. First, f(x) is a 
vector, depending only on x, so it must be Cx where C is a scalar. (Since f depends 
only on x, and there is no preferred direction in (8.62), what other vector could it 
be?) But then x • f = x • Cx = Cr 2 , so the scalar C may be evaluated as x • f /r 2 , 
i.e., 


x ■ f_ 1 f x • x'dQ 1 

r 2 r 2 J |x -x'| ' 


(8.63) 


Next, let P be the angle between x and x', as in Fig. 8.21, so that x x' = ra cos/l; 
and write C as an integral over ft 

a C n cos p 2 tt sin fid(3 

C = - / - r - (8.64) 

r Jo s/r 2 + a 2 — 2 ra cos fi 


using the fact that 2n smfidji is the solid angle d£l' of a strip of the unit sphere 
around the axis defined by x. Finally, change the variable of integration from 
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to u = cos P, and evaluate the integral over u with an integral table, or with an 
analytic computer program. But be careful ; the result is different for r < a and 
r > a: 



udu 

Vr 2 + a 2 — 2 rau 



(8.65) 


where r< is the smaller of r and a, and r> is the larger of r and a. The final re¬ 
sult is 


uo era 

A(x) = —-— u> x x if r < a 
uncra 4 

A(x) = - 5— u> x x if r > a. 

ir 5 


( 8 . 66 ) 

(8.67) 


Now calculate B(x) = V x A, as an exercise in vector calculus, and sketch a 
graph of the field lines. 15 The field has a very simple form, as one might expect 
from such a simple current source: Inside the sphere B is uniform and in the 
direction of u>, because A(x) has the same form as (8.57). Outside the sphere B 
is a pure dipole field, i.e., the same as if there were a point magnetic dipole at the 
origin—a field configuration that we study next. 


8.6 ■ THE MAGNETIC DIPOLE 


What is the asymptotic magnetic field far from a static current distribution of 
finite extent? In Chapter 3 we addressed the analogous question for electrostatics, 
and derived the multipole expansion for the electric potential. We now construct 
the similar expansion for the vector potential. For an arbitrary current distribution 
the multipole expansion of A(x) has in order dipole, quadrupole, octopole terms, 
etc.; there is no monopole term. We will derive only the dipole term, which is the 
dominant term at large distances unless the dipole moment happens to be 0. 


8.6.1 ■ Asymptotic Analysis 

The vector potential A(x) is given by (8.60). The source J(x') is assumed to be 
localized in a limited region of space. Far from the source, i.e., for r»r' where 
r = |x| and r' = |x'|, the following approximation is valid: 


1 


Ix-x'l 



( 8 . 68 ) 


We used the same expansion in Eq. (3.61) of Chapter 3, when deriving the asymp¬ 
totic V (x) of a static charge distribution. In (8.68) r is x/r , the unit vector parallel 
to x. We will neglect the last term in (8.68), i.e., terms that decrease as r -3 or 
faster as r increases; these are quadrupole and higher multipole terms. 

15 


See Exercise 22. 
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We will first show that the first term on the right-hand side of (8.68) gives no 
contribution to the vector potential. If we substitute that term into (8.60), the result 
is proportional to the integral of J over all space, which we will prove is 0, 

J J (xVV = 0. (8.69) 

How to prove this simple equation is not so obvious! Note that 

Ji(x) = V.(*iJ)-ZiV-J. (8.70) 

(We temporarily drop the prime on the source point to simplify the notation.) But 
V • J = 0 for magnetostatics, by the continuity equation, so the second term on 
the right side of (8.70) vanishes. And the integral of the first term V • (x,J) is 0 
by Gauss’s theorem, because x,J is 0 on the sphere at infinity; J has finite extent. 
Hence (8.69). 

The asymptotic potential is obtained by substituting the second term of (8.68) 
into (8.60) 


A(x) = -J—j f J(x')r ■\'d 3 x'. (8.71) 

47T r 2 J 

We will show that the integral in (8.71) can be reexpressed as 

J J(x')f • x'd 3 x' = m x f (8.72) 

where m, called the magnetic dipole moment of the current distribution, is de¬ 
fined by 


m= lyV x J(xVV- (8.73) 

The dipole moment m is an intrinsic property of the current distribution. That is, 
m does not depend on the observation point x. 

To derive (8.72) requires some real ingenuity! First note that 16 

J V • (xiXjj)d 3 x = j (xiJj +XjJj)d 3 x = 0. (8.74) 

The first equality is a vector identity, using the fact that V • J = 0 for magneto¬ 
statics. The second equality is true because the far left-hand side is 0 by Gauss’s 
theorem, since x,x y J is 0 on the surface at infinity; J has limited extent. 17 Now, 
(8.74) says that / XiJjd 3 x is antisymmetric in ij; therefore we have 

16 As usual the indices j , k take the values 1, 2, 3 corresponding to Cartesian coordinates x, y, z. 
Also, we temporarily drop the prime on the source point to simplify the notation. 

17 A mathematician would say “J has compact support.” In other words, the region of space where 
J yt 0 does not extend to infinity. 
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J (xiJj - XjJ^d^x = 2 j xtJjd^x = €ijk /(xx J ) k d 3 x 

where k is summed from 1 to 3, and ey* is the Levi-Civita tensor. We may then 
write 

J XiJjd 3 x = €ijkm k . (8.75) 

Equation (8.72) now follows: Restoring the prime on the integration variable, i.e., 
the source point, multiplying both sides of the equation by (r),-, and summing 
over i, 


J Jj(x')x' • rd V = €jjkm k r, = (m x r) ; .. (8.76) 

Equation (8.76) is (8.72). 

The dipole vector potential is, from (8.71) and (8.72), 


. „ . Mo m x r mo m x x 

A(x) = - 5 — = -—t-. 

4ji r- 4 7i |x | 3 


(8.77) 


This is the asymptotic vector potential of a finite current distribution, assuming 
m is not zero. Equation (8.77) is also the vector potential of a point-like mag¬ 
netic dipole. What we mean by point-like is that the spatial extent of the current 
distribution is much smaller than any other relevant lengths in the system. The 
magnetic field due to the dipole is 

an [3r (m • r) - ml 

B(x) = V x A = — i— - J- -± (8.78) 

4n r 3 

Note that B(x) for a point magnetic dipole has exactly the same pattern of field 
lines as E(x) for a point electric dipole (3.72). 

In spherical polar coordinates the vector potential and magnetic field of a point¬ 
like magnetic dipole m = mk, located at the origin and pointing in the z direction, 
are 


A(x) = 


pom sin(9 
4nr- 




B(x) 


MO m 
471 ^ 


2 r cos 6 + 0 sin 0 


]• 


(8.79) 

(8.80) 


8.6.2 ■ Dipole Moment of a Planar Loop 

As an example of a finite current distribution, consider a planar current loop C 
with current I. We may picture a wire loop carrying the current, as in Fig. 8.22. 
What is the dipole moment? The shape of the loop is irrelevant as long as it is 
planar. Recall that J(x , )rf 3 x / is the same as I di for current in a wire, where di 
is a loop segment at x' as shown in Fig. 8.22. With this replacement in (8.73), the 
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FIGURE 8.22 A planar current loop. The magnetic moment is m = I An because 
f x di = Ah. The origin 0 is arbitrary. 


dipole moment of a wire loop is 

m=I<j> l -x' xdi. (8.81) 

The vector ^x' x di is perpendicular to the plane of the loop, and its magnitude 
is equal to the area of the infinitesimal triangular sector bounded by x', x' + di, 
and di; that is, ix' x di is an infinitesimal area element dA of the loop. Thus 

m = I An (8.82) 

where n is the normal unit vector, and A is the area of the loop. The dipole moment 
points perpendicular to the loop, with magnitude I A. 

8.6.3 ■ Torque and Potential Energy of a Magnetic Dipole 

Up to now we have considered the field created by a magnetic dipole. Now we 
turn to the dynamics of a dipole. What is the force or torque on a dipole in an 
applied field B? (It is important to understand that B is an external field applied 
to the dipole, not the field produced by the dipole.) We’ll answer this question by 
considering the special case of a planar current loop, although the result we will 
obtain holds in general. 

The torque on a planar current loop, due to the magnetic force on the current 
distribution, is 

N = ^xxdF = ^xx (Idx x B), (8.83) 

where dx is an infinitesimal segment of the curve at x. The equation simplifies if 
B is uniform in the region of the current, by some cross product identities. First 
note that 


d [x x (x x B)] = x x (dx x B) + dx x (x x B) (8.84) 

assuming B is constant. The loop integral of a perfect differential is 0, because for 
a loop the endpoints of integration are the same point, so the loop integrals of the 
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two terms on the right side are equal but opposite. Therefore, we may write N as 

N = - (f> [x x ( dx x B) — dx x (x x B)] (8.85) 

2 Jc 

= - (b [(x X dx) X B] (8.86) 

2 Jc 

where the second equality is an example of the cross-product identity 

a x (b x c) + b x (c x a) + c x (a x b) = 0. (8.87) 

Again assuming that B is constant, we may pull xB out of the integral. (To jus¬ 

tify this step it may be helpful to express ( 8 . 86 ) in Cartesian coordinates.) The 
remaining integral is m by (8.81); hence 

N = m x B. (8.88) 

The result ( 8 . 88 ) may be applied generally to any point-like magnetic dipole. The 
torque on a magnetic dipole is in the direction twisting the dipole moment m 
toward alignment with the field B; that is, the torque is perpendicular to B. 

Because we have derived ( 8 . 88 ) for an arbitrary planar loop the calculation is 
rather intricate. To gain some intuition, study Fig. 8.23, which shows specifically 
a square loop. 

The orientational potential energy associated with the torque ( 8 . 88 ) is given by 

U = -m B. (8.89) 



FIGURE 8.23 A square current loop. The magnetic forces on currents in the top and 
bottom sides are equal but opposite, making a torque m x B on the loop, pointing out of 
the page. 
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m + dm 



m 

FIGURE 8.24 Rotation of m. The change of m due to the rotation dO is dm = dO x m. 


This result can be derived by a purely mechanical argument. Assume the internal 
currents that make m are not changed by rotation of m, so that the magnitude |m| 
is constant. The potential energy U is defined by the statement that the work done 
by the torque, during an infinitesimal rotation of the dipole, is — d U. The work 
is N • dd, where dO is rotation; the direction of dO is the axis of rotation and the 
magnitude is the angle. The change of m is dm = dB x m, by Fig. 8.24, so 

dU = -N • dO = -(m x B) • d9 = — (d6 x m) • B = -dm ■ B; 


hence (8.89). 

If the magnetic field varies with x, then there is also a translational force on a 
point-like dipole, given by 

F = — V(7 = V (m • B(x)) . (8.90) 

The force is 0 if B is uniform, just as the force on an electric dipole is 0 if E is 
uniform. 

If a magnetized needle is put through a cork floating on the surface of water, the 
needle becomes aligned with the Earth’s magnetic field, pointing north, because 
of the torque (8.88). However, the cork does not move toward the north, because 
for that to happen there would have to be a gradient in the Earth’s field, as shown 
by (8.90). In fact the Earth’s field does have a gradient, but it is much too small to 
affect the cork. 

A famous example of a translational force on a magnetic dipole due to a 
non uniform B(x) is the Stem-Gerlach experiment. A beam of neutral silver atoms 
is sent through a region between the poles of a magnet. The pole shapes are asym¬ 
metric, designed to create a gradient of B. The beam splits into two parts, because 
some Ag atoms have their dipole moments parallel to B while the others are an¬ 
tiparallel. The force (8.90) deflects the two groups of atoms in opposite directions. 
This experiment led to the discovery of electron spin, which contributes to the 
magnetic moment of an atom. 

It is interesting to compare the torque on a magnetic dipole m in B, to the 
torque on an electric dipole p in E. The comparison is illustrated in Fig. 8.25. The 
electric dipole consists of equal but opposite charges, which are forced in opposite 
directions along E. The magnetic dipole may be visualized as a small square loop 
of current, opposite sides of which are forced in opposite directions perpendicular 
to B. In either case the torque is perpendicular to the field, twisting the dipole 
toward alignment with the field. 
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FIGURE 8.25 Torques on electric and magnetic dipoles. Where, and in what directions, 
are the forces in each case? 


A notable difference between p and m. It is interesting to compare (8.89) with 
the analogous potential energy U = — p • E, equation (3.78), of an electric dipole 
p in an electric field E. The results are analogous because simply interchanging m 
with p and B with E interchanges the magnetic and electric expressions. But there 
is a fundamental difference between the two cases which is interesting to explore. 
Equation (8.89) is an orientational potential energy, equal to the work that must 
be done to turn m from the orientation with zero energy (m perpendicular to B) to 
any other orientation, keeping |m|, |B|, and x fixed. Equation (3.78) is something 
more general, because it is equal to the total work that must be done to bring the 
dipole from infinity, where the potential energy is taken to be 0, to the position x. 
The derivation of (3.78) depended on the existence of monopole charges +q and 
— q. The magnetic case cannot be treated in the same way because there are no 
magnetic monopoles. 

What then is the total potential energy U t ota i required to bring m from infinity 
to x? To consider this, it is helpful to think of m as a current loop, and of the 
source of B as current-carrying coils. A proper calculation of {/total must take 
into account the work done to keep the loop and coil currents constant against the 
forces associated with moving the coil through the field. 18,19 The rather surprising 
result of this analysis is that {/total = +m ■ B. To analyze the complete energetics 
of the system it is {/total that is the relevant potential energy. However, to analyze 
the motion of the dipole itself, assuming the currents that create m and B are 
constant in magnitude, it is correct to calculate forces and torques with (8.88), 
(8.89), and (8.90). 

Magnetic dipole dynamics. The rotational motion of a dipole under the influ¬ 
ence of the magnetic torque is determined by the equation of motion dL/dt = 
m x B, where L is the angular momentum of the object. Rotational dynamics is 
diverse, and sometimes nonintuitive. The rotational motion of the dipole depends 
on the relation between m and L. If m is constant along a body-fixed axis, as in 

18 77ie Feynman Lectures, Vol. II, Secs. 15-1 and 15-2. 

19 G. H. Goedecke and R. C. Wood, Magnetic dipole orientation energies, Am. J. Phys. 67, 45 (1999). 
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the case of a magnetized needle, then one class of rotational motion is oscillation 
about the equilibrium. The equilibrium is m aligned with B. An example of this 
case is the oscillation of a compass needle about the north direction. 20 For the 
oscillating needle, m and L are perpendicular; m is in the plane of the compass 
and L is along the pivot axis. On the other hand, if m is proportional to L, so 
that these vectors are parallel or antiparallel (as is the case for electrons, protons, 
neutrons, and other elementary particles) then the motion is precession about B, 
analogous to precession of a top. 

The interaction energy (8.89) is important in nuclear magnetic resonance 
(NMR). The dipole moment of an atomic nucleus is proportional to the angular 
momentum, which is quantized. Therefore the energy of the nucleus in an ap¬ 
plied magnetic field is quantized. NMR is the absorption of radiation by quantum 
transitions between the magnetic energy levels. NMR is the basis for magnetic 
resonance imaging (MRI) widely used in medical applications. 

8.6.4 ■ The Magnetic Field of the Earth 

Understanding the Earth’s magnetic field is one of the oldest problems in science, 
and it is still not completely solved. The sources of the field are self-generated 
electric currents in the Earth’s outer core. The outer core is a conducting liquid 
with resistivity p ^ 1.7 x 10 -6 £'2m, made of molten Fe and Ni under high pres¬ 
sure, extending in radius from about 0.2J?Earth to 0.55J?Earth- Ganymede, a moon 
of Jupiter and the largest moon in the solar system, also has a magnetic field due 
to currents in a molten core. From radiometric dating of magnetic rocks there is 
evidence thatthe Earth’s magnetic field is changing slowly and has been changing 
for the last 160 My or longer. The magnetic poles at the surface wander around 
the Earth’s rotation axis on a long time scale and the polarity has reversed many 
times, on a time scale of about 200 ky, over millions of years. 

At the surface of the Earth the magnetic field is approximately equal to a dipole 
field whose magnetic moment is /nEarth = 7.79 x 10 22 Am 2 . The field at the 
surface is predominantly the asymptotic dipole field, but accurate measurements 
reveal the small multipole components. We can deduce the field inside the Earth 
from field measurements at the Earth’s surface, at least to the depth of the source 
currents, from the source-free field equations V • B = 0 and V x B = 0. The form 
of the magnetic field near its source is very different from a dipole field, and is 
. quite complex. 21 


8.7 ■ THE FULL FIELD OF A CURRENT LOOP 

In this chapter we have considered a circular loop of current several times. In 
Example 2 we found the magnetic field on the axis of the loop. In Sec. 8.6.1 we 

20 See Exercise 33. 

21 D. R. Stump and G. L. Pollack, A current sheet model for the Earth’s magnetic field, Am. J. Phys. 
66, 802(1998). 
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found the vector potential of a magnetic dipole, which may be thought of as a 
“point” current loop. In Sec. 8.6.2 we found the dipole moment of a loop. It is 
natural to discuss the circular loop, because it is highly symmetric and, in some 
sense, the most elementary source of a magnetic field. 

It is also natural to ask: Can we find th e full field, everywhere, for a circular 
loop of current, by using the analytical techniques developed in this chapter? The 
answer is yes, but the analysis is somewhat more challenging than the problems 
we have done so far, and the result is left in terms of elliptic integrals—tabulated 
functions less familiar than algebraic or trigonometric functions. In this section we 
find that magnetic field, as an illustration of the power and limitations of analytical 
methods, and their connection to numerical methods. 

Let the loop lie in the xy plane, have radius R, and carry current I , as shown in 
Fig. 8.26. We know from the symmetry of the problem that the magnitude of B(x) 
cannot depend on the aximuthal angle 4>. For convenience we choose to consider 
a field point P in the yz plane, with coordinates (0, y, z). The position vector of 
P is 


x = y j + zk. (8.91) 

An arbitrary source point has coordinates (x', y', z') = (R cosip ', R sin^', 0), so 
that its position vector is 


x' = R cos <p' i + R sin <p' j. (8.92) 

The current element I ill, shown in Fig. 8.26, is tangent to the loop at x', counter¬ 
clockwise in the xy plane, so the length element is 

dl = Rdip'sirup'i + cos ip'. (8.93) 


z 



FIGURE 8.26 Geometry of a current ring. The current element Idl is at x' in the xy 
plane. The field point P is at x in the yz plane. 
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The Magnetic Field 

We’ll first find B(x) from the Biot-Savart law (8.21), 


B(x) 


do 

4tr 


Idl x (x — x') 
|x-x'| 3 


The relative vector, and its magnitude, are 


(8.94) 


x — x' = —R cos 0' i + (y — R sin0') j + zk (8.95) 

|x — x'| = ^R 2 + y 2 + z 2 — 2 yR sin 0'. (8.96) 

The cross product in the numerator of (8.94) can be evaluated by expanding in the 
Cartesian unit vectors, leading to the result 


B(0, y, z ) = 


doIR f 27Z z cos 0' i + zsin0'j + (R - y sin0')k^, 
^ TC JO (/? 2 + y 2 + z 2 — 2yR sin0') 3 ^ 2 


(8.97) 


The x component of B in (8.97) can be evaluated explicitly. But before doing 
so, we can see that B x = 0 from symmetry, by examining Fig. 8.26: The con¬ 
tribution to B x at P due to the current element at 0' is canceled by that due to 
the current element at n — <p'. To evaluate B x explicitly, change the variable of 
integration from 0' to u = R 2 + y 2 + z 2 — 2y R sin cp'. That is, 


B x 


doiz r^ d 
4try J 0 / = o 



= 0 . 


(8.98) 


The integration is over an entire cycle of 4>', so u is the same at the two endpoints, 
and the integral is 0. The integrals for B y and B z in (8.97) can be expressed in 
terms of elliptic integrals, an analysis that can be found in advanced books, 22 but 
we won’t pursue that approach here. Alternatively, the integral over <p' can be 
evaluated numerically by computer. 23 

The symmetry of this problem suggests that it is natural to rewrite the field in 
terms of spherical components B r and Be, and the spherical coordinates r and 0 
of P, rather than Cartesian components and coordinates. After all, we chose P to be 
in the yz plane only for convenience in calculation. We already know that B$ = 0, 
because in the yz plane the ^-component of B is parallel to the x axis. Thus by 
symmetry B has the form ? B r +0Be, where the components are independent of 0. 
To get B r (r, 6 ) and Be(r, 0) from the Cartesian components B v (y, z) and B z (y, z) 
at the point P, we make in (8.97) the substitutions y = r sin 0 and z = r cos 6. 
Also, the connections between the spherical and Cartesian components at P are 


22 Jackson; Smythe. 
23 See Exercise 41. 


B r = By sin 9 + B z cos 9, 


(8.99) 
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Be = By cos 9 — B z sin9; 


( 8 . 100 ) 


because in the yz plane, ? = j sin 9 + k cos 9 and 0 = j cos 9 — k sin 9. With 
(8.97) to (8.100) we have determined B in terms of integrals over <p'. The results 
in elliptic integrals are 


B r (r,0) 


Be(r, 9) = 


HoIR 2 cos 9 


4 7T 


r In 

Jo 


dtp' 


P-o IR 
4 7T 


r2n 

Jo 


( R 2 + r 2 — 2rR sin 0 sin 0 ') 
(r sin0' — R sin0) dcp' 


a 3/2 


[R 2 +r 2 — 2rR sin 9 sin tp') 


3/2 ■ 


( 8 . 101 ) 


( 8 . 102 ) 


The Vector Potential 

Another way to analyze the current loop of Fig. 8.26 is to find the vector potential 
A(x); then B = V x A. By (8.60) 


A(x) 


- f 
4 7T J 


d£ 


(8.103) 


Considering again the point P in the yz plane, the Cartesian representation of 
A(x) is 


A(x) = A*i + A y j 


palR r 2n 

47 t J 0 


— sin^'i +cos</>'j 
y/R 2 + y 2 + z 2 — 2yR sirup' 


dtp'. (8.104) 


Note that A y = 0, because the integral for A y has essentially the same form as 
the integral for B x that we found to be 0 in (8.98). We could have anticipated this 
result, because (8.103) shows that the contribution to the vector potential from a 
small current element is parallel to the current. The current in Fig. 8.26 runs in 
the rp direction, and contributions to Ay(0, y, z) from pairs of points on the ring 
cancel. 

At this point it is natural to use spherical coordinates. We have seen that the 
vector potential is azimuthal, so A(x) = A$(r, 9)<jr, and by converting from y and 
z in the yz plane to r and 9, 


A$(r, 9) = 


+/M)IR r 27T sin <p'd<p' 

Jo y/R 2 +r 2 -2rR sin 9 sin (p ' 


(8.105) 


This reduces A(x) to a finite integral. 

By calculating A(x), we gain something and lose something. The gain is that 
instead of the two integrals (8.101) for B r and (8.102) for Bn, we now have only 
the one integral (8.105) for A However, any of these integrals would have to 
be evaluated numerically. In the case of (8.97) we are getting the components of 
B itself, i.e., what we are interested in. In the case of (8.105) we must calculate 
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derivatives of A$ to obtain B. Furthermore, to calculate the derivatives numeri¬ 
cally, it would be necessary to evaluate A(x) on a fine mesh of points. The loss is 
that in a numerical computation it would be necessary to compute A(x) precisely. 


Expansion in Legendre Polynomials 

It is interesting that a further simplification can be applied to the integrals for B r , 
Be, and A^. They can be expanded in Legendre polynomials. We shall apply this 
technique to (8.105). 

In Sec. 5.2.3 we learned that l/|x — x'| is the generating function of Legen¬ 
dre polynomials. Equations (5.84a) and (5.84b) express this inverse distance as a 
series. Applying then (5.84a) to the integrand in (8.105) we may write 


LlolR 00 / R\ n f 2 ^ 

Aj,(r,6) = -V - / P n (s'm6 sincp') sin <p'd<p'. (8.106) 

) J o 

Equation (8.106) holds for the region r > R, i.e., beyond one ring radius from 
the center; for the region r < R one must use (5.84b). Equation (8.106) is the 
multipole expansion for a circular ring. The terms in the sum with n even are 0: 
P n (sin 0 sirup ') is a polynomial in sin <p' with only even powers if n is even; when 
multiplied by the factor sin cp' they integrate to 0 over the range from <p' = 0 to 
2tt. [The integral of any odd power of sin <p' over the cycle (0, 2 tt) is zero.] Thus 
the first nonzero term in the series is the n = 1 term, the dipole term; the series 
has only odd n’ s. 

The series in (8.106) converges rapidly unless R/r is close to 1. To compute B 
for R/r < 1 one can evaluate V x A analytically, keeping only the first few terms 
of the series for sufficient accuracy, from 

B r = 1 - -^(sinflA^) and B e = (r A^). (8.107) 

r sin0 30 ’ r dr ' 

Figure 8.27 shows the magnetic field of a loop of current, in a half-plane per¬ 
pendicular to the wire. Very close to the wire the lines of B circle around the 
current, like the field of a straight wire. Far from the loop, the term in (8.106) 
with n = 1 dominates the series, and A approaches the dipole vector potential 


A 0 % 


pom sin 0 
4-n r 2 


(8.108) 


where m = InR 2 . It is the intermediate fields, i.e., in the region where r is 
comparable to R, that are hard to calculate. This foreshadows what we will leam 
about the fields radiated by oscillating dipoles and antennas in Chapter 15. For that 
case, too, the near and intermediate fields are complicated, while the far fields are 
(relatively) simple. 
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FIGURE 8.27 Magnetic field lines of a current ring. The ring lies in the xy plane 
centered at the origin. Field lines in the yz plane are shown. 


EXERCISES 

Sec. 8.1. The Magnetic Field 

8.1. A Hall probe is a device for measuring magnetic fields. In a typical device a current 
I flows in a ribbon of n-type doped semiconductor located in the magnetic field. In 
an n-type doped semiconductor the charge carriers are electrons. In the steady state 
there is a voltage difference Vfj between the edges of the ribbon, such that the net 
perpendicular force on a charge carrier q1Z±_ + q(y x B)x is 0. From Vfj one can 
determine B. 

(a) Assume the geometry shown in Fig. 8.28, with dimensions 1.0 cm by 0.2 cm by 
0.005 cm. The material is As-doped Si, with 2 x 10 15 conduction electrons per 

1.0 cm 



FIGURE 8.28 Exercise 1. The Hall effect. The sign of Vfj depends on the sign of the 
mobile charges. The battery supplies the voltage difference that produces the current. 
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cm 3 , and resistivity 1.6 J2cm. The current is driven by a 3 V battery across the 
1.0 cm length. What Hall voltage will be measured across the 0.2 cm width if 
the field strength is 0.1 T? [Answer: Vfj = 11.7 mV] 

(b) Show that the sign of Vh depends on the sign of the charge carriers. Edwin 
H. Hall measured Vfj in metals in 1879, and found that the charge carriers are 
negative. 

Sec. 8.2. Applications of the Magnetic Force 

8.2. The experiment by which Thomson discovered the electron, consisted of a cathode 
ray passing between parallel capacitor plates in a uniform magnetic field. The elec¬ 
trons travel parallel to the plates and B is perpendicular to both E and v. Derive the 
condition relating the potential difference Vq between the plates and the magnetic 
field strength, along with any other relevant parameters, such that the cathode ray is 
undeflected, assuming the cathode ray is a beam of electrons. (This is the principle 
of the velocity selector in a mass spectrometer.) 

8.3. Consider a proton with kinetic energy 10 MeV in a cyclotron with magnetic field 1 
T. Calculate the cyclotron frequency and the radius of the orbit. 

8.4. Quadrupole focusing. A magnetic quadrupole field can be used as a focusing field 
for a charged particle beam. The cross section of the pole faces is shown in Fig. 
8.29. The pole faces are hyperbolas of the form xy = constant. There are two north 
poles and two south poles, marked on the figure. The dimension of the magnet per¬ 
pendicular to the cross section is l. The magnetic field in the region 0 < z < i 
is 

B (x,y,z) =b(yi + xj) 

in which b > 0; the field is 0 for z < 0 and z > l. Particles enter from negative z 
with velocity v 0 = u()k and are deflected by the force F = q\Q x B. (Neglect the 
small components v x and v y in calculating the force.) 



FIGURE 8.29 Exercise 4. A quadrupole focusing magnet. The hyperbolic curves are 
the boundaries of the pole faces. A charged particle beam moves out of the page. 
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(a) Sketch the B field lines in the .icy plane. 

(b) Explain qualitatively why B produces focusing in the x direction and defocusing 
in the y direction, assuming the beam particles are positively charged. 

(c) Write the equations of motion for a beam particle with charge q and mass m, 
using the approximate force given above. Solve for x as a function of z for z > 0, 
assuming x = .to and v x = 0 at z = 0. Sketch a graph of x(z)- 

8.5. (a) Use a computer program to plot the cycloid curve in the xy plane, given by the 

parametric equations 

x(6) = # — sin#, 
y(6) = 1 — cos#. 

(The cycloid i s the curve traced out b y a point on a circle of radius 1 as the circle 
rolls on the x axis.) 

(b) For a charge q moving in crossed (i.e., orthogonal) E and B fields, starting from 
rest at the origin, plot the kinetic energy as a function of time. 

8.6. Figure 8.30 shows the essential features of an early mass spectrograph of A. Demp¬ 
ster. Singly positive ions enter the vacuum chamber vertically through the slit, after 
having been accelerated through a voltage of 20.0 kV. Their paths are bent by the 
magnetic field B and they are deposited a distance s from the slit on a photographic 
plate. 

(a) If s = 0.250 m for ions of Samarium with mass number 150, i.e., l 50 Sm 62 > what 
isB? 

(b) What is the range of s for the stable isotopes of Sm, whose mass numbers range 
from 144 to 154? 

8.7. Figure 8.31 shows a schematic version of a mass spectrograph of K. T. Bainbridge. It 
is a modification of the design of Fig. 8.30 because it includes a velocity selector in 
the vacuum chamber through which the positive ions pass. The selector uses a hori¬ 
zontal E field and the same B field that bends the path of the ions in the spectrograph 
proper. 

What is the mass of the ions that impinge on the photographic plate a distance s 
from the slit? [Answer: M = eB 2 s/2E] 


vacuum chamber 


B 


// 
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entrance slit 
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FIGURE 8.30 Exercise 6 . The Dempster mass spectrograph. 
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FIGURE 8.31 Exercise 7. The Bainbridge mass spectrograph. 


Sec. 8.3. Biot-Savart Law 

8.8. By pictures showing the directions of the currents, fields, and forces, prove that 
parallel but opposite currents repel each other. 

8.9. Consider two very long parallel wires a distance d = 1 m apart, carrying equal 
currents / in the same direction. 

(a) What is the force per unit length / if / = 1 A? (Notice that if we define = 
An x 10 -7 N/A 2 , then a specified value of / can be used to define the unit of 
current. This is in fact the way the ampere is defined.) 

(b) What must / be to have / = 1 N/m? [Answer: About 2 kA] 

8.10. Helmholtz coils. Two circular coils of radius a, each carrying current I in the same 
sense, are parallel with the xy plane with their centers at (0, 0, ±.v/2). On the z axis 
the magnetic field is B = B(z)k, in the z direction; and at z = 0, i.e., halfway 
between the coils, dB/dz is 0. 

(a) Determine i such that 3 2 B/3z 2 is 0 at z = 0 on the z axis. This configuration 
is called Helmholtz coils, and it produces a very uniform magnetic field in a 
neighborhood of the origin. Show that for this configuration the third derivative 
with respect to z is also 0 at z = 0. 

(b) Use a computer program to plot the magnetic field B(z) as a function of position 
z along the axis of the Helmholtz coils. 

Helmholtz coils are often used in laboratories to cancel out the Earth’s mag¬ 
netic field in a small region of space. However, if a large magnetic-field-free 
region is needed, say room sized, the necessary coils would be impractically 
large, as you can see from your answer to (b). In practice, to produce a field-free 
volume one surrounds it with a material with high magnetic susceptibility,'e.g., 
Mumetal, which concentrates the magnetic field in the walls. 

8.11. Find the magnetic field at the center of a square loop of size 2 a x 2 a carrying cur¬ 
rent I. Repeat the calculation for a regular polygon with n sides, letting the perpen¬ 
dicular distance from the center to any side be a. Show that the result approaches the 
field at the center of a circular loop of radius a in the limit n —> oo. 
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8.12. A square wire loop of size 2 a x 2 a lies in the xy plane with its center at the origin 
and sides parallel to the x and y axes. A counterclockwise current / runs around the 
loop. 

(a) Find the magnetic field on the z axis. [Answer: B z (z) = 2/zo/a 2 /[jr(o 2 + 
z 2 )(2o 2 + z 2 ) 1 / 2 ]] 

(b) Show that for z/a 3> 1 the field becomes that of a magnetic dipole, and find the 
magnetic moment. (See (8.78) and (8.80).) 

(c) Compare the field at the center of this square loop with that at the center of a 
circular loop of diameter 2a. 

8.13. Consider a circular cylindrical solenoid of finite length L, radius a , with N turns 
of wire carrying current Iq. The current may be approximated by a surface current 
density K (= azimuthal current per unit length along the cylinder) equal to NIq/L. 

(a) Calculate the magnetic field on the axis of the cylinder halfway between the 
ends. (Hint: Subdivide the solenoid into infinitesimal current rings dl = Kdz 
and use (8.30).) 

(b) Calculate the magnetic field on the axis of the cylinder at either end. 

(c) Show that B ent j /B center approaches 1 /2 as L/a —> oo. 

8.14. Consider the magnetic field B(x) = axy i + by 2 j . 

(a) What relation must connect the constants a and ti! 

(b) What current density J(x) produces this field? Describe the current distribution 
in words and pictures. 


Sec. 8.4. Ampere’s Law 

8.15. Consider a circular current loop in the xy plane, with radius a, centered at the origin, 
and carrying current I . On the z axis the magnetic field has the form B z (z)k. De¬ 
termine B z (z) and evaluate B z dz. By Ampere’s law the integral must be p-o I. 
Why? 

8.16. Use Ampere’s Law to determine the magnetic field in a densely wound toroidal 
solenoid with inner radius a, outer radius b, and N total turns of wire. The cross 
section of the toroid is a rectangle of width b — a and height h. Show that in the 
limit a, b —>■ oo with b—a fixed, and with the linear density of turns fixed, the result 
agrees with a straight solenoid. 

8.17. Let the current distribution in the slab of Example 5 be J(z) = ( jo\z\/a) i , where jo 
is a constant with dimensions A/m 2 . 

(a) What is B inside the slab, and above and below it? 

(b) Sketch a plot of By (z). 

8.18. Figure 8.32 shows the cross section of a long coaxial cable. The center conductor 
(r < a) carries a current Iq in the direction out of the page. The outer conductor 
(b < r < c) carries the return current /o into the page. The currents are uniformly 
distributed in the conductors. Between the conductors there is an insulating material 
with permittivity eo an d permeability p,Q. 

Find B for all r and sketch a plot o f B versus r. 
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FIGURE 8.32 Exercise 18. Cross section of a coaxial cable. 


8.19. Let B be the magnetic field due to a long thin wire carrying current I . Let the wire be 
parallel to the z axis and passing through the point (x , y) = (a, 0) in the xy plane. 
Explicitly evaluate <£B • dt for a circle of radius R, with R > a, centered at the 
origin. (The answer must be fiQI by Ampere’s Law.) 

(Hint: This calculation is not as easy as it looks! On the circle, di is Rd<fi<fi, but 
the field B is not in the direction <?>. Rather, B is in the direction t, tangent to a circle 
centered at (a, 0). Use Cartesian coordinates to show that 

* « R — a cos <p 

" (R 2 + a 2 — 2oRcos(^>) 1 / 2 


Sec. 8.5. Vector Potential 

8.20. Consider the vector potential A(x) = Ac x x, where c is a constant vector. Does this 
potential satisfy the Coulomb gauge condition? What is the magnetic field? 

8.21. Equation (8.60) is a general formula for the vector potential of a localized static 
current density. Prove that V • A = 0 and — V 2 A = /xoJ. 

8.22. The vector potential A(x) of a spinning sphere (radius a, angular velocity vector to) 
with surface charge density a is given in Example 7. Calculate B(x), for points x 
both inside and outside the sphere, sketch the magnetic field lines, and describe the 
results in words. 


Sec. 8.6. Magnetic Dipole 
8.23. (a) Derive (8.78). 

(b) A small circular loop carrying current I is in the xy plane with its center fixed 
at the origin. The dipole moment is in the same direction as +k. Sketch the 
magnetic field lines. 

(c) A second identical current loop is located on the x axis at some fixed distance 
d from the origin. Assume the axis of the first loop is fixed in the z direction, 
but the second loop is free to rotate. What is the equilibrium orientation of the 
second loop? What is the direction of the second dipole moment in equilibrium? 
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8.24. At the surface of the Earth the magnetic field of the Earth is approximately the same 
as the field from a point dipole mg at the center of the Earth. The dipole moment is 

mg = mg ^ i sin 9q cos <J>q + j sin 9q sin <po + k cos 

where mg = 7.79 x 10 22 Am 2 , and (#o> <Ao)=( 169 degrees, 109degrees). The z 
axis is the Earth’s rotation axis and the x axis passes through the Prime Meridian, 
on which Greenwich lies; positive <j> is to the east. 

(a) Calculate the magnetic field B at a point on the Earth with colatitude 9 and 
longitude <p. Give the components of B to the north (—9), to the east (<p), and 
vertical (?). 

(b) Calculate these components for your home town, in gauss (1 gauss = 10 -4 T). 

8.25. Two identical point-like dipoles are located, respectively, at the origin and atx = zk 
on the z axis. Both dipole moments point in the +z direction. Determine the force on 
the dipole at zk. What happens if the dipoles point in opposite directions? 

8.26. Let the dipole m = mk be at the origin, and call a certain horizontal axis the y axis. 

(a) On the z axis, what is the angle between the z axis and B? 

(b) On the y axis, what is the angle between the z axis and B? 

(c) On the cone 9 = 45 degrees, what is the angle between the z axis and B? 

(d) What is the angle of the cone on which B is horizontal? [Answer; arccosfl/yi)] 

8.27. Figure 8.33 shows three parallel, identical magnetic dipoles. Each has dipole mo¬ 
ment mo j • They are situated at the comers of a 45-90-45 right triangle whose sides 
are s, ~j2s, and s. 

(a) How much work i s required to reverse the direction of m 2 , i f the others are held 
fixed? 

(b) What is the torque on m 2 , in the position shown? 


y 



FIGURE 8.33 Exercise 27. Three magnetic dipoles. 
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(c) What is the torque on mi, in the position shown? [Answers: (a) iiQm^/(2ns 3 )\ 
(b) 0; (c) 3ti 0 ml/(8ns 3 )k] 


General Exercises 

8.28. (a) A long cylindrical conductor of radius Rq, whose axis coincides with the z axis, 

carries a uniformly distributed current /q in the +z direction. Find the magnetic 
field B(x) inside and outside the cylinder. 

(b) A cylindrical hole is now drilled out of the conductor, parallel to the axis, so that 
the cross section is as shown in Fig. 8.34. The center of the hole is at x = a, and 
the radius is b. The conductor carries the same current /q as in (a). Determine 
the magnetic field in the hole. You will find that the magnetic field in the hole is 
uniform. 

8.29. The magnetron is a vacuum-tube device that is used to generate ultra-high frequency 
currents in microwave sources, like microwave ovens or radar transmitters. The fre¬ 
quency range is 10 9 Hz to 10* 1 Hz. A schematic design for a magnetron is shown in 
Fig. 8.35. An electron bunch circulates in a constant magnetic field B, passing elec¬ 
trodes at opposite ends of a diameter of the orbit. The potential V at either electrode 
oscillates with the distance from the electron bunch. 

(a) Determine the frequency of the alternating potential. 

(b) Determine B for a microwave frequency of 10 10 Hz. 

8.30. A beam of hydrogen isotopes enters a mass spectrometer. The protons and deuterons 
have been accelerated from rest by a potential drop Vq. The radius of the proton orbit 
is 10 cm. Calculate the radius of the deuteron orbit. 

8.31. A thin disk of radius R carries a surface charge o. It rotates with angular frequency 
co about the z axis, which is perpendicular to the disk and through its center. What 
is B along the z axis? What is the magnetic moment of the spinning disk? [Answer: 
m z = ncrcoR 4 /4] 


y 



FIGURE 8.34 Exercise 28. Cylinder with a cylindrical cavity. 
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8.32. A charged annular disk with inner radius a and outer radius b, and surface charge 
density a, lies in the xy plane with its center at the origin. It rotates about the z axis 
with angular velocity u>. 

(a) What is the magnetic moment m? 

(b) What is B in the xy plane at distance p from the origin, where p 3> a,b1 

(c) Find B z ( 0, 0, z) for all values of z. 

8.33. A circular coil (radius R and mass M) has N turns of wire carrying current Iq. 
The coil is free to rotate about the z axis, which lies in the plane of the coil and 
passes through its center, as shown in Fig. 8.36. There is a constant magnetic field 
B = Bo i • Initially the coil is in stable equilibrium with its magnetic moment parallel 
to the field. 

(a) What is the frequency for small angle oscillations in 0 about the equilibrium 
position? 

(b) Now let M = 0.10 kg, N = 100 turns, Iq = 0.1 A, and Bq = 0.05 T. Evaluate 
the frequency of small angle oscillations. 



FIGURE 8.36 Exercise 33. Coil in a magnetic field. 
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(c) If the coil is released from rest at the small angle $o = 15 degrees, what is its 
angular velocity as it swings through the equilibrium position? 

8.34. Show that the magnetic field on the axis of a finite solenoid of radius a centered on 
the z axis, extending from z = —1/2 to z = 1/ 2, and with n turns per unit length of 
wire carrying current /, is 


B(0, 0, z) 


Von I 


e/2-z 


+ 


1/2 +z 


y/a 2 + {1/2 - z) 2 yja 2 4- (f/2 + z) 2 


Computer Exercises 

8.35. The equations of motion of a charge q in a magnetic field Zi()k are 

dx/dt = v x , dy/dt = tt v 

dv x /dt = coVy, dvy/dt = —cov x 

where co = qBq/m. Solve the equations numerically on a computer. Set co = 1 and 
take initial values (xq, yo> v 0x, v 0y) = (1. 0, 0. 1). 

You might integrate the equations stepwise for a small time step. Or, more simply, 
use an analytic computer program with a built-in differential equation solver. 

Plot the trajectory, i.e., (x(t), y(t)) as a function of t. It should be a circle. 

8.36. The equations of motion of a charge q in crossed electric and magnetic fields Eqj 
and Bqk for motion with v z = 0 are 

dx/dt = v x , dy/dt = tt v 

dv x /dt = &>i> v , dvy/dt = a — cov x 

where co = qB§/m and a = qE^/m. Solve the equations numerically on a com¬ 
puter. For illustration purposes choose units with co = 1 and a = 1. If the particle 
starts at rest at the origin the trajectory is a cycloid. Explore what happens for differ¬ 
ent initial values of vo*. both positive and negative, keeping vq v = 0. Explain why 
if vq x = 1 (in these units) the particle moves on a straight line. 

8.37. A familiar type of problem in magnetostatics is to determine the magnetic field fora 
specified current distribution. Here we consider the inverse problem: Given the field 
B(x), what current density J(x) would produce that field? 

(a) Suppose the vector potential is 

A(x) = ^e~ r2 '“ 2 4>. 

r 

(We specify A because that guarantees that V • B = 0.) r is the cylindrical radial 
coordinate, i.e., the perpendicular distance from the z axis to x. Determine B(x) 
and J(x). Describe in words and pictures the field and current. Use an analytical 
and graphical computer program to carry out the rather intricate calculations, 
and to plot the results. 

(b) Suppose instead the vector potential is 

A(x) = Aoe~ r ~/“ 2 k., 
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where again r is the cylindrical radial coordinate. Analyze B(x) and J(x) as 
in (a). 

8.38. Themagnetic field of a magnetic dipole attheorigin is given in (8.78). Let the dipole 
moment point in the z direction. Use computer graphics to make a figure showing 
the vector field: At each point of a dense grid in the xz plane, have the computer 
draw an arrow in the direction of B. 

8.39. The vector potential for a pointlike magnetic dipole uiqU. at the origin is (8.77). It is 
infinite at r = 0. Consider now the nonsingular potential 

, W«0 k x ? p 0 «o sinf? - 

A(x) = —-r--=■ = —-;- r~<p, 

4tt r 2 + a 2 4jt r 2 + a 2 

which approaches (8.77) for r S> a. Determine the current density J(x) that would 
create this vector potential. Use computer graphics to make a contour plot of J,i, in 
the yz plane. 

8.40. Use a computer program to plot the magnetic field of a ring of current, on the axis 
of the ring, as a function of distance from the center of the ring. 

8.41. Consider the current loop of Sec. 8.7—a circle in the xy plane centered at the origin 
and with radius R. For points on the xy plane the magnetic field has the form B = 
B~(/-)k, where r — -Jx 2 + y 2 is the distance from the origin. By computing B z (r) 
numerically, plot a graph of B z (r) as a function of r. (Use R as the unit of length 
and po 1 /R as the unit of magnetic field.) Be careful: The field has a singularity at 
the wire. On the same graph show the field of a pointlike dipole with the same dipole 
moment m = tzR 2 I. 



CHAPTER 



Magnetic Fields and Matter 


The most general problem in magnetostatics is to calculate the magnetic field 
everywhere in a system of specified current loops and magnetic materials. We 
saw in Chapter 8 how the Biot-Savart law determines the magnetic field produced 
by a steady current, but we assumed that any matter that might affect the field was 
far away and negligible. In other words, we only described a magnetic field in a 
vacuum. 

But the presence of matter—solids, liquids, or gases—changes the magnetic 
field from what it would be in vacuum. In this sense all materials are magnetic 
materials. They change the field because they contain microscopic, atomic-scale 
currents and magnets which are themselves sources of B. In this chapter we study 
why the field changes and how to calculate the new field. We shall acquire a 
qualitative and semiquantitative understanding of magnetic interactions in matter, 
leant how and why magnetic properties vary among materials, and look at some 
applications. 

Most magnetic materials— diamagnetic or paramagnetic materials—interact 
with externally applied fields in the manner just described. These interactions 
are usually weak, so although such materials have an important role in physics 
for understanding magnetism, their role is not familiar from everyday use. But 
some magnetic materials— ferromagnetic materials—interact strongly with exter¬ 
nal fields. Furthermore, ferromagnetic materials can be permanent magnets ; that 
is, they can be sources of strong magnetic fields by themselves. Some applications 
of ferromagnets are large permanent magnets used industrially, small refrigerator 
magnets or those in toys, large solenoids in magnetic resonance imaging and other 
medical applications, and small solenoids in relay switches on cars and electrical 
devices. 


9.1 ■ THE ATOM AS A MAGNETIC DIPOLE 

We learned in Chapter 6 that when an electric field E is applied to an isotropic 
insulator the material becomes electrically polarized, i.e., it develops an electric 
dipole moment per unit volume P(x), which is called the polarization, parallel 
to E. Similarly, when a magnetic field B is applied to a magnetic material the 
material becomes magnetically polarized, i.e., it develops a magnetic dipole mo¬ 
ment per unit volume M(x), which is called the magnetization. Magnetization is 
a density—the density of magnetic moment. The dimensions of M are A m 2 /m 3 
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or A/m. The phenomena associated with magnetic polarization are rather more 
complex than those of electric polarization. For example, in some materials M 
points in the direction of the applied field B, in which case the material is called 
paramagnetic; in others M points opposite to the direction of B, in which case 
the material is called diamagnetic. In ferromagnetic materials, whose considera¬ 
tion we leave for Section 9.6, M is parallel to B and very large. It is important to 
realize that M(x) may vary with position x in the material. 

Electrons are the microscopic origin of the macroscopic magnetic properties 
of matter. Every electron is a small magnet, because it has a magnetic moment 
m sp in associated with its spin. The spin is the electron’s intrinsic angular momen¬ 
tum. For electrons, the direction of m sp i n is opposite to the spin direction, and the 
magnitude, called the Bohr magneton, is 

pi B = — = 9.27 x 1(T 24 A m 2 (9.1) 

2m e 

where m e is the electron mass. The sum of electron spin moments in the neigh¬ 
borhood of x is one contribution to M(x). 

Another electronic contribution to M(x) comes from the atomic orbital mo¬ 
tion of electrons. Quantum mechanics is necessary for an accurate theory of the 
electronic structure of matter, but we can get some understanding of magnetic ma¬ 
terials from simpler semiclassical models. An electron (charge — e) moving with 
velocity v in a circular orbit of radius r is equivalent to a current I = ev/(2nr) 
circulating around a loop of area A = nr 2 . Figure 9.1 is a schematic picture of 
an electron revolving around a nucleus whose charge is Ze. The orbital magnetic 
moment is w orbital = IA = evr/2. Thus w orbital is directly related to the orbital 
angular momentum L = m e vr, by 


m orbital — ,, • (9.2) 

2 m e 

(The direction of m is opposite to L because the electron is negative.) The angular 
momentum and magnetic moment are both quantized; L z must be m(h, where 



FIGURE 9.1 Classical electron orbit. In a classical model the electron revolves around 
the nucleus, creating a magnetic moment. 
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mi = 0, ±1, ±2,..., ±£ is the angular momentum quantum number. It follows 
that 


k ‘ Ulorbital — M B ' ■ 

so Worbitai is also of order n is. The sum of orbital moments in a neighborhood of 
x contributes to M(x). 

If no magnetic field is present, then usually the m’s of different atoms point in 
random directions, and so M(x) = 0. However, if there is a net dipole moment 
<5m in a small volume 8 V at x then M(x) = <5m/<5V\ This is the usual averaging 
procedure: We obtain the macroscopic M(x) by averaging over small volumes 
that yet contain many atoms. 

Table 9.1 lists some magnetic moments of fundamental interest, and it is clear 
from the table that the range of magnetic moments is very large. (There are 
astronomical objects—pulsars—with much larger magnetic moments than the 
Earth’s.) From Table 9.1 we see that the magnetic moments of the proton and 
neutron are much smaller than that of the electron; the proton moment is of order 
eh/2m p , smaller than /x b by the ratio m e /m p ~ 0.5 x 10~ 3 . Therefore the nu¬ 
clear particles do not contribute significantly to the macroscopic magnetic proper¬ 
ties of matter. However, the proton magnetic moment and the magnetic moments 
of heavier nuclei are the subjects of important research and applications in nu¬ 
clear magnetic resonance (NMR) and magnetic resonance imaging (MRI). In this 
context it is interesting to note the importance of the magnetic nuclei Fluorine- 
19 and Phosphorus-31, which, because they are the only stable isotopes of these 
elements, find wide use in magnetic resonance studies. 

It might seem surprising that the neutron, a neutral particle, has a magnetic 
moment, but if we recall that the neutron is a bound state of quarks (which are 


TABLE 9.1 Some magnetic dipole moments 


Magnetic system 

Symbol 

Value in A-m 2a 

Electron spin 

Proton spin 

Neutron spin 

Typical compass needle 

Typical superconducting current loop 

Earth 

Me = “MB 

Mp 

Mu 

-9.27 x 10~ 24 
1.41 x 10~ 26 
-0.966 x 10~ 266 
10 -2c 

20 c 

7.79 x I0 22d 


“The unit A-m 2 is equivalent to J/T. The Bohr magneton fig is 9.27 x 10 24 J/T, or5.79x 10 5 eV/T. 

*The minus sign means that the direction of the neutron’s magnetic moment is opposite that of its 
angular momentum. 


C D. R. Stump and G. L. Pollack, Am. J. Phys. 65, 81 (1997). 

‘TGRF model for 2000. The Earth’s magnetic moment is directed approximately opposite to its angu¬ 
lar momentum. 
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charged spin-1/2 particles) it is not surprising at all. Neutron beams are used to 
probe the microscopic magnetic structure of matter, for example, the magnetic 
structure of superconductors. This technique exploits the interaction between the 
neutron’s magnetic moment, small though it is, and the atomic magnetic moments 
in the superconductors. 

9.1.1 ■ Diamagnetism 

Many everyday materials, e.g., water, wood, glass, and polyethylene, as well as 
many elements, e.g., H 2 , N 2 , Ar, Cu, Ag, Pb, are diamagnetic. When a diamag¬ 
netic material is placed in an external B field, it becomes magnetized: It acquires 
a magnetization M proportional to |B|, but M points in the direction opposite 
to the direction of B. If the field is removed the magnetization of a diamagnetic 
substance vanishes. Because M is opposite to B, a diamagnetic material will be 
repelled, although weakly, by a bar magnet; like poles repel. Diamagnetism is 
essentially independent of temperature. 

Diamagnetism was first observed by Brugmans in 1778, in Bi and Sb. A needle 
of Bi, or any diamagnetic material, suspended on a pivot in a magnetic field, has 
its equilibrium position perpendicular to the field. In contrast, when a needle of a 
paramagnetic material is thus suspended it comes to equilibrium aligned parallel 
to the field; this is like the familiar behavior of a compass needle in Earth’s mag¬ 
netic field. It is natural to ask: Why does a diamagnetic needle align cross-wise 
to the field? This question requires some consideration. The principle is that in a 
diamagnetic material the atomic dipoles, which are induced by B, always point 
opposite to B, whatever the orientation of the needle, so the lowest energy state 
is the perpendicular orientation. Figure 9.2 shows the equilibrium positions of 
diamagnetic and paramagnetic needles in a magnetic field. 



f + ♦ T1 

r N 

diamagnetic 

needle 


_N 

t 

\ 
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paramagnetic 
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FIGURE 9.2 A needle in a magnetic field. Equilibrium for a diamagnetic needle is to 
be perpendicular to B; or, for a paramagnetic needle to be parallel with B. 
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The names “diamagnetism” and “paramagnetism” were invented by Faraday, 
who studied these phenomena extensively beginning in 1845. The names are 
based on the criteria that equilibrium is parallel to B for a paramagnetic nee¬ 
dle and perpendicular (“diametrical”) for a diamagnetic needle. In Faraday’s time 
there was no knowledge of atomic structure (nor even of the existence of atoms!); 
the fact that some materials align parallel to B and others perpendicular must have 
seemed a real puzzle. 

Diamagnetism is a rather anti-intuitive phenomenon because one would ex¬ 
pect, from N = m x B, that the torque on microscopic dipoles in the material 
would tend to line them up parallel to the applied field. But this model is too 
simplistic, because diamagnetism, and indeed every magnetic phenomenon, is re¬ 
ally a quantum mechanical phenomenon. It is natural to use quantum theory to 
explain magnetism, because the magnetic properties of matter originate in the 
electronic structure of the atom—a quantum system. Unfortunately a full quan¬ 
tum discussion is rather intricate, and beyond the scope of this book. So instead 
it is customary 1 to give a classical demonstration of the plausibility of diamag¬ 
netism and we’ll study that now. This explanation, proposed by J. J. Larmor in the 
early 1900’s, is based on Faraday’s Law of electromagnetic induction. 

The classical picture of diamagnetism is based on a planetary model of the 
atom, reminiscent of the Bohr model but without quantization. An atomic electron 
is assumed to revolve around the nucleus on a circular orbit. For simplicity we’ll 
imagine a single electron. This model is only heuristic, and of course differs from 
the real hydrogen atom, which has zero angular momentum in the ground state. 

Figure 9.3 represents the model atom in a magnetic field. The electron revolves 
counterclockwise around the proton at radius r. The magnetic field B = Bk is 


y 



FIGURE 9.3 Induced dipole moment. In a classical model, applying a magnetic field 
changes the speed of the electron, and so the magnetic moment. 

1 The Feynman Lectures, Vol. II, Chap. 34. 
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perpendicular to the electron orbit, out of the page. When no magnetic field is 
present we can write the equation of motion e 2 /(Aneo r 2 ) = m e v 2 /r, calling v the 
electron speed; i.e., the Coulomb attraction is the centripetal force on the electron. 
However, when B is present there is an additional inward force on the electron, 
due to the Lorentz force, as shown in Fig. 9.3. If the electron orbit remains the 
same while the magnetic field is turned on, i.e., r is the same with or without B, 
then the electron must speed up as B increases, reaching a new speed v' when 
the field strength is B. Adding the magnetic force the equation of motion then 
becomes 


e 2 . v' 2 

- - T+evB=m e —. (9.3) 

47i6or l r 

We defer until Chapter 10 the proof that r remains constant when B is applied, 
because that proof relies on Faraday’s Law. 

Now, if m e v 2 /r is substituted for e 2 /(4n eor 2 ) in (9.3) then, after rearranging, 
(9.3) becomes 


TYl p .'■y -f TTT Pi / / 

— (i/ 2 - u 2 ) = — (v' + i>)(u -v) = ev'B. (9.4) 

r r 

For actual atomic sizes and realistic fields the velocity increase ( v' ~ v) — Sv 
is very small compared to v. We may therefore say ( v' + u) ~ 2v' so that (9.4) 
implies Sv — eBr/2m e . Then from the argument leading to (9.2) the change in 
ntorbitai associated with Fig. 9.3 is 


er - e r * 

,5m =-(Su)k =-flk. (9.5) 

2 4 m e 

The minus sign in (9.5) tells us that the change in the magnetic moment of this 
classical atom, brought about by the application of B, is opposite to the direction 
of B. Figure 9.3 shows counterclockwise motion of the electron, but we would 
get the same result if we considered clockwise motion, because in that case the 
additional force on the electron is radially outward. Then the electron slows down 
as B is applied, so that again the change in the magnetic moment is in the direction 
opposite to the applied field. If the net moment in a neighborhood of x is 0 for B = 
0, it will be opposite to B after B is applied. The material is therefore diamagnetic 
according to this classical picture, as claimed. 

We have seen how the direction of m ends up opposite to B. The final result 
(9.5) is even reasonable quantitatively. 2 However, the crucial step in the argument 
was passed over lightly: The orbital radius r remains constant while B is changing. 
That follows from Faraday’s Law of electromagnetic induction; we’ll see that in 
Sec. 10.3. In qualitative terms the changing B induces a current opposing the 
change of B, by Lenz’s law, i.e., an induced moment opposite to B. 

^See Exercise 5. 
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The result (9.5), although obtained from classical physics, is the same as the 
quantum mechanical result if r 2 is properly interpreted. We shouldn’t read too 
much into that; some similarity in the quantum and classical results is inevitable 
by dimensional analysis and the correspondence principle. Roughly, r 2 is the 
mean square radius of the electron quantum state. The quantum mechanical expla¬ 
nation for diamagnetism is based on the momentum term in the electron Hamilto¬ 
nian. If there is no magnetic field then the momentum term is the familiar p 2 /2 m e , 
where p is the momentum. If there is a magnetic field then this term becomes 
(p — e\) 2 /2m e , where A is the vector potential. It is the small term e 2 \ 2 /2m e 
that explains diamagnetism. 

The principles of quantum mechanics apply, of course, to all materials so all 
materials have a diamagnetic response. However, in paramagnetic and ferromag¬ 
netic materials the diamagnetism is masked by larger effects. Materials that have 
complete electron shells, and therefore no net orbital or spin angular momentum 
in zero field, have only diamagnetic behavior. Examples of such purely diamag¬ 
netic materials are the inert gases, He, Ne, Ar, Kr, Xe, and Rn. Some diamagnetic 
ions are singly positive ions of alkali metals, Li + , Na + , K + , Rb + , Cs + , and singly 
negative halide ions, F - , Cl - , Br - , I - , and At - . We will see otherexamples later. 

We have considered just the response of one electron. The response of the 
whole atom will be the sum of the effects of all the electrons. Because diamag¬ 
netism depends on the internal dynamics of the atom in its ground state, it does 
not depend on the temperature. 

9.1.2 ■ Paramagnetism 

When a paramagnetic substance is placed in an external B field it becomes magne¬ 
tized with M proportional to B and in the same direction as B. The magnetization 
vanishes when the field is removed. Because M is parallel to B, a paramagnetic 
material will be drawn into a magnetic field; e.g., it will be attracted by a bar 
magnet. 

The cause of paramagnetism is that the atom or molecule of the material has a 
permanent dipole moment. In other words, the electronic structure of the ground 
state of the atom has a nonzero m, even if B = 0. In the presence of B these 
permanent moments align partially with the field, because of the magnetic torque, 
and make up the magnetization. Not all materials have a permanent atomic or 
molecular moment; those with m = 0 are diamagnetic. 

The typical order of magnitude of a permanent atomic dipole moment is p.B, 
coming from unpaired electron spins or orbital angular momenta in the atom. 
This is large enough that the paramagnetic response dominates the diamagnetic 
response. Any atom with an odd number of electrons will have a magnetic mo¬ 
ment and will therefore be paramagnetic. Some atoms with an even number of 
electrons have a permanent magnetic moment because inner electron shells are 
unfilled; examples of such atoms are Cr, Fe, Ni, Pd, and Pt. Also, almost all the 
rare earth elements have permanent magnetic moments because they have elec¬ 
trons in an unfilled 4f-shell. The 4f states have large orbital angular momentum 
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(l = 3) and degeneracy (21 + 1 — 7). Some rare earth elements have very large 
atomic moments because they have many electrons with their spins aligned, and 
also with large orbital angular momentum. Molecules in most cases are not para¬ 
magnetic because the electron spins are paired. However, some molecules, for 
example NO which has 15 electrons, are paramagnetic. A particularly interesting 
and important case is O 2 which, although it has 16 electrons, has two unpaired 
electrons and is therefore strongly paramagnetic. 

The preferential alignment of the permanent moments parallel to B is favored 
by the interaction energy U = —m • B, which is minimum if m and B point in the 
same direction; but it is opposed by thermal fluctuations, which are characterized 
by the energy kT at temperature T. The angle 8 between m of a particular atom 
and B fluctuates between 0 and n ; but it is a little more likely to have 8 < n/2 
than 8 > n/2, so the mean orientation is parallel to B. 

In classical statistical mechanics the thermal average of cos# at high tempera¬ 
tures is 


mB 

(cose) = 


As T —*■ 00 the direction of m becomes random, equally likely to have 8 < n /2 
or 8 > n/2, so (cos 8 ) -* 0. Paramagnetism is a small effect at room temperature 
because mB kT. For example, suppose m = 113 , a typical value, and B = 1 T 
and T = 300 K; then (cos 8 ) = 7.5 x 10 -4 , which corresponds to an angle of 
89.96 degrees, an alignment of only 0.04 degrees in the direction of B. 

If the atomic density is n and each atom has a magnetic moment with mag¬ 
nitude mo then the magnetization in the presence of B, neglecting interactions 
between the atoms, is 


M = «mo(cos#)k = 


3kT 


B. 


(9.6) 


The result shows that the paramagnetic response varies with temperature as 1/ T. 
The linear relationship between M and B holds down to low temperatures and 
high fields. At very low T and high |B|, the atomic dipoles approach a state of 
complete alignment with the field. In this limit M approaches a limiting, or satu¬ 
ration, value, and thus remains constant if |B| increases still further. 


9.2 ■ MAGNETIZATION AND BOUND CURRENTS 

If a magnetic body has a magnetization M(x') as a function of position x' in the 
material, what magnetic field B(x) does it produce? We’ll restate this question 
by asking for the vector potential A(x), from which B(x) can be calculated by 
B = V x A. The magnetization M(x') can be either permanent magnetization 
in the absence of an applied field, as in ferromagnets, or the result of magnetic 
polarization by an applied field, as in diamagnets or paramagnets. In the latter case 
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FIGURE 9.4 A magnetic object, dm is the dipole moment of a small volume d^x' at 
xThe field at P is the integral of fields produced by elemental moments. 


the resultant field is the superposition of the applied field and the field produced 
by the magnetized material. 

Because individual electrons are the ultimate source of M, the magnetic mo¬ 
ment density fluctuates widely on a microscopic scale depending on how many 
electrons or magnetic atoms are included in a microscopic volume. For our macro¬ 
scopic purposes we can ignore this fine-scale variation by considering volumes 
d?x' that contain many atoms but are small on the scale of the macroscopic sys¬ 
tem. On the macroscopic scale M(x') is a smooth function and we can use it in 
integrals. 

Figure 9.4 shows such a magnetic object. The magnetic moment of the volume 
element d^x' is dm = M(x')^ 3 .r', density times volume. We seek A(x), then, 
at the point P. Recall Eq. (8.71), which gives A(x) for a point-like magnetic mo¬ 
ment m, apply it to the elemental moments dm, and integrate over the volume V 
of the body; the result is 


_ do f d m x (x - x') _ do f M(x') x (x - xQ 3 , 
4tt Jv |x — x'| 3 4 n Jv |x-x'| 3 


(9.7) 


There are many problems in which M(x') is given and the field is sought, but 
(9.7) is seldom used, as written, to solve such problems. One reason is that the 
integrand can be unwieldy. But a more important reason is that (9.7) can be recast 
into a form that is easier to use. 

First note that the integrand in (9.7) may be written as 


M(x') x (x - x') , / , 1 \ 

integrand = ^ = M(x') X (v'_) 


(9.8) 


where V' is the gradient with respect to x'. Now integrate by parts. That is, swing 
the del operator over to the other term in the integrand by writing 


integrand = 


V' x M(x') 
Ix-x'l 


- V' X 



(9.9) 
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(The latter result is an application of the vector identity 


Vx (/F) = (V/)xF+/VxF, 


applied to the variable x' with F = M(x') and / = |x — x'| _1 .) The volume 
integral of the second term in (9.9), which is a total derivative, may be converted 
to a surface integral by the identity 3 


[ V' x Gd 3 x' = / n x G dA’\ 
Jv Js 


(9-10) 


S is the boundary surface of V, and n is the outward normal at dA'. Combining 
these results, we write the vector potential as 


A(x) 


_ Mo f 

An Jv 


V'x M(x') 3 , ix o f M(x') x ndA' 

|x-x'| X+ 4n Js |x-x'| 


(9.11) 


Equation (9.11) should look familiar. It has the same form as equations (8.54) 
and (8.55) for the vector potential of specified current distributions, volume and 
surface respectively. Comparing the first term on the right of (9.11) with (8.54) 
shows that V' x M(x') occupies the place of a volume current density; we’ll call 
it Jb(x'), the density of bound current at x' in the magnetized body. Likewise, 
the second term on the right of (9.11) may be compared with (8.55), showing 
that M(x') x n may be considered to be a surface current density; we’ll call that 
Kb(x'), the density of bound current at x' on the surface of the magnetized body. 
Equation (9.11) thus tells us something interesting about magnetism and gives us 
an important tool for calculating magnetic fields: The magnetic field produced 
by a magnetized object is the same as that of a volume current Jb and a surface 
current Kb, where 



Jb(x') = V' x M(x') 

(9.12) 

and 

Kb(x') = M(x') x n. 

(9.13) 

To calculate the field we may replace the actual magnetization M(x') in the object 
by the effective currents Jb(x') and Kb(x'). Finally, to express this explicitly, we 
can write (9.11) as 

A(x) = 

W f uV + M ( Kb<X ' ) dA'. 

An J v |x - x'| An J s |x - x'| 

(9.14) 


9.2.1 ■ Examples 

We will now apply this approach to two examples. The examples are relatively 
simple—limited to highly symmetric bodies in which M is constant. Even these 

3 This integration identity may be derived by applying Gauss's divergence theorem to C x G, where 
C is an arbitrary constant vector, and using the identity n ■ (C x G) = —C • (n x G). 
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problems are challenging. Clearly for more general problems—consider for ex¬ 
ample a geological ore with inclusions of variously magnetized material mixed 
with nonmagnetic substances—the integrals over volume and surface in (9.14) 
would be very difficult. 


EXAMPLE 1 What is the field of a uniformly magnetized sphere? 

Let the magnetization in the sphere be M = M k and its radius be a, as shown 
in Fig. 9.5. Because M is constant there is no volume current density: Jb = V x 
M = 0. There is, however, a surface current density Kb = M x ? = M sin0^>. 

It will not be necessary to redo the integral over Kb(x') in (9.14) because we 
have evaluated an integral of the same form in Example 7 of Chapter 8 (the rotat¬ 
ing charged sphere). Both of these examples have a surface current proportional 
to sin ft on a sphere. Notice the similarity between Figs. 9.5 and 8.17; in partic¬ 
ular Kb in Fig. 9.5 is parallel to v of Fig. 8.17. If we replace M by aacoV., then 
the field of this example is the same as that of a sphere of radius a with surface 
charge density a rotating with oo = <wk. 

Carrying over the result of Example 7 of Chapter 8, and replacing aaco by M, 
we obtain the vector potential of the magnetized sphere. In spherical coordinates 
the result is 


* Uo ivi 

A(r, 6 ) — A^ir, 9)(f> =—-—rsinft</> for r < a, (9.15) 

Mr, ft) = A${r, 0)(j> = s i n 6><£ for r > a. (9.16) 

3 r l 




FIGURE 9.5 A magnetized sphere. The magnetization is uniform inside the sphere. 
There is a corresponding bound surface current density Kb on the sphere. 


318 


Chapter 9 Magnetic Fields and Matter 


The corresponding magnetic field V x A is 

B(r, 6 ) = ^/xoM = -MoM cos0 — 0sin0^ for r < a, (9.17) 

B(r, 6 ) = cos0 + 0sin0) for r > a. (9.18) 

3 r i 

In writing the second equality of (9.17) we used the unit vector identity k = 
? cos0 — 0sin0. 

In magnetostatics the field B satisfies certain boundary conditions. We will 
develop these conditions in detail later 4 but it is worthwhile to anticipate that 
development by noting the behavior of B at the boundary of the sphere in this 
example. From (9.17) and (9.18) at r = a we have: 

(i) The normal component of B is continuous, 

#int ,r(a, 9) = B ixUr (a, 9) = |/r o Mcos0; 

(ii) The tangential components of B are discontinuous and the discontinuity is 
related to the surface current density, 


Bext,e(a,9) - B- mue (a,0) = /roMsintf = fi 0 K b , 0 . 


Equation (9.17) shows that inside the sphere B is uniform and parallel to M. 
Equation (9.18) shows that outside the sphere B is the same as the field of a point 
magnetic dipole at the origin (see (8.73)) with dipole moment m = ( 4 / 3 ) 7 ra 3 M. 
We know that the far field (r a) must have this form because M is the dipole 
moment per unit volume and ( 4 / 3 ) 7 ra 3 is the volume of the sphere. But it is 
somewhat surprising here, as it was in Example 7 of Chapter 8 , that the external 
field is a pure dipole field. It is pleasing, but maybe now not so surprising, to 
know that there is a theorem generalizing this result . 5 The theorem states that if a 
sphere has any spherically symmetric multipole density, i.e., a multipole moment 
per unit volume that depends only on r, then the whole external field is the same 
as if the total multipole moment were at the origin. 


EXAMPLE 2 What is the magnetic field on the axis of a uniformly magnetized 
cylinder? A physical realization of this example could be a cylindrical bar magnet. 

Let the magnetization of the cylinder be M = M k, its radius a, and its height 
h, as shown in Fig. 9.6. Again, as in Example 1, there is no volume current be¬ 
cause Jb = V x M = 0 for uniform magnetization. As for surface current, there 
is none on the flat surfaces, z = 0 and z = h, because M is, respectively, antipar- 

4 See Section 9.3.4. 

5 M. J. Harrison andR. D. Spence, Am. J. Phys. 62, 828 (1994). 
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FIGURE 9.6 A magnetized cylinder. The magnetization is uniform inside the cylinder. 
There is a corresponding bound surface current density Kb on the cylinder. 


allel and parallel to n there. There is bound surface current on the curved surface 
r = a, because there Kb = M x ? = M<f>. Thus for purposes of calculating 
B(x), a uniformly magnetized finite cylinder is equivalent to a stack of elemental 
rings each with radius a and current dl = Mdz'. Note too that this problem is 
equivalent to finding B for a finite solenoid. 

To calculate the magnetic field we first note, from symmetry, that on the z axis 
B = B z (z)k. Using (8.26) for the field on the axis of a current ring and integrating 
elemental rings over the length of the cylinder yields 


D , N _ doMa 2 f h dz! 

° AZ) ~ 2 Jo W + (z - z') 2 ] 3 / 2 

_ /UqM / h — z z 

2 y yj a 1 + (h — z) 2 a 1 + z 2 


(9.19) 


which answers the question we posed. We leave it as an exercise 6 to plot B z (z). 

In Example 1 we found B(x) everywhere in space for the magnetized sphere, 
but in Example 2 we limited ourselves to finding B on the cylinder axis. Can we 
find B(x) everywhere in space for the magnetized finite cylinder of Fig. 9.6? How 
would we proceed? 

The answer is that we can find B(x) everywhere for the cylinder but it is analyt¬ 
ically difficult. One approach is to obtain A(x) for the finite cylinder by integrating 
dA(x) for elemental current loops making up the bound surface current. (We know 


6 See Exercise 2. 
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the vector potential of a ring from Sec. 8.7.) In practice it would be necessary to 
use numerical techniques to do the calculation. It seems somewhat perverse that 
a magnetostatics problem so easily stated and with so much symmetry as this one 
should be analytically so difficult. After all, cylindrical bar magnets and solenoids 
are in many everyday devices. We leam from this that, in electromagnetism, even 
easy problems can be hard. 

On the other hand, it is straightforward to find the field of the magnetized 
cylinder far away. For r y>> a and h, B approaches a dipole field (8.73) with 
dipole moment m = na 2 hMk, the total dipole moment of the cylinder. On the 
far z axis, i.e., z » a and h , we have B z (z) = noMa 2 h/2z?\ on the far z = 0 
plane, i.e., r » a and h, we have B z (r) = ~iXQMa 2 h/Ar 2 . Another case for 
which we can easily write the answer is an infinitely long cylinder. That case is 
equivalent to an infinite solenoid with current per unit length M <f>, so B ou tside = 0 
and Binside = IA)Mk. 


9.2.2 ■ A Geometric Derivation of the Bound Currents 

The equations (9.12) and (9.13) for bound current densities Jb and Kb are impor¬ 
tant to our understanding of magnetism in matter. Their derivation has been rig¬ 
orous, but mathematical and not intuitive. It is worthwhile to derive the equations 
again, by geometric arguments, to better appreciate their meaning. In particular, 
we shall see that V x M is indeed a current density. 

Imagine a magnetized body subdivided into small rectangular cells. Each cell 
contains many atoms, although it is small on the overall scale of the body. For 
simplicity we’ll assume the magnetization is everywhere parallel to the z axis 
M = Mk but the magnitude may vary. M is the magnetic dipole moment density, 
so the total dipole moment of a cell of size Sx x Sy x Sz is MkSxSySz. We will 
attribute this total moment—the sum of the moments of many atoms—to a surface 
current K on the four faces of the cell parallel to the z axis, as shown in Fig. 9.7(a). 
The dipole moment due to the surface current is k I A, where I is the integrated 
current density / = KSz and A is the area A = SxSy, comparing with the formula 
for M, 


MSxSySz = (KSz) (<5x<5y), 

so the effective surface current on the faces has K = M. The direction of K on 
any of the four boundary surfaces of the cell is tangent to the surface, and we see 
from Fig. 9.7(a) that K = M x n, where n is the outward normal of that surface. 

If M is constant throughout the sample, then the surface currents on cell-faces 
inside the sample cancel; each interior face is part of the boundary of two neigh¬ 
boring cells, with surface currents in opposite directions. In this case the only 
bound current is Kb = M x n on the exterior surface of the sample. 

If M(x) varies with position in the material, then the internal bound currents 
do not cancel, so there is a volume current density Jb(x). Again for simplicity 
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FIGURE 9.7 Geometric derivation of bound current, (a) A small rectangular cell 
with magnetization M out of the page and equivalent surface current K counterclockwise, 
(b) Two neighboring cells; the magnetization in the upper cell is greater than that in the 
lower cell, so there is an effective current to the right at the interface. 


taking M = M k, consider neighboring cells centered at y + Sy/2 and y — 8y /2, 
as shown in Fig. 9.7(b). The net current in the x direction through the area SySz 
between the cell centers (indicated by a dashed line) is 

J bx Sy8z = M z (y + 8y/2)8z - M z (y - 8y/2)8z, 

because K x = ±Af- is the surface current density at either side of the interface 
between the cells. In this expression, M z (y ± 8y/2) means M z evaluated at y ± 
8y/2. Now, treating 8 y as infinitesimal, J\, x — dM z /dy. Similarly, by considering 
neighboring cells centered at x + Sx/2 and x — 8x/2, there is a volume current 
density in the y direction of J\, y = —dM z /dx. These results are precisely the x 
and y components of the vector equation Jb = V x M for the case when M = M k. 

In a magnetized object there are no charged particles moving macroscopic dis¬ 
tances through the material, but there is electric current. The current comes from 
electrons bound in atoms, and its macroscopic effects are described by Jb (x) and 
Kb(x). These effective currents result from adding the atomic currents of many 
atoms in a neighborhood of x. Both the geometric arguments and the rigorous 
calculus proof leading to (9.11) show that the bound volume current density is 
Jb = V x M and the bound surface current density is Kb = M x n. 


Pictorial Description of Bound Current 

Some simple pictures can help to visualize the relation Jb = V x M, which is the 
main principle for understanding the magnetic effects of matter. 
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FIGURE 9.8 Magnetization and bound current. A small cell containing many aligned 
dipoles, and the equivalent effective surface current K = M x n. The magnetization points 
out of the page. 


Figure 9.8 shows a small cell in the material with many aligned dipoles, rep¬ 
resented as small current loops. On an atomic scale the magnetization fluctuates 
wildly with position, but the macroscopic magnetization, i.e., due to the combined 
effects of many atoms, is uniform M = nmk. The same magnetization would be 
produced by a surface current density M x n around the four faces of the cell. 

Figure 9.9 shows 5 contiguous cells with magnetization M z (x )k pointing in 
the z direction with M z increasing in the +x direction. Just by looking at the 
picture we see that M curls counterclockwise around — j . Therefore there must 
be a bound current out of the page (—j direction) with 


z 



X 


FIGURE 9.9 Curl of M is J 5 . The magnetization points in the z direction and increases 
in the +x direction. At the center of each cell a typical atom is represented as a small 
current loop. There is a bound current out of the page. 
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J b = VxM = -^j. (9.20) 

dx 

The figure shows at the center of each cell a representative atomic dipole (there 
are many dipoles in each cell) as a small current loop. (Of course the electrons 
revolve in the opposite direction.) The picture shows why there is a bound current 
out of the page. For example, consider the region near the interface between the 
two right-most cells. The current out of the page on the left side of the cell to 
the right of the interface is greater in magnitude than the current into the page 
on the right side of the cell to the left of the interface—making a net current out 
of the page near the interface. The picture is consistent with (9.20). By a similar 
argument the student can verify that there is a net current out of the page near any 
of the interfaces between neighboring cells. 


9.3 ■ AMPERE'S LAW FOR FREE CURRENTS, AND H 

We have seen that there are two kinds of current in matter, free current Jf and 
bound current Jb. Free current is familiar; it’s the mean translational motion of 
charges that are free to move throughout the material. Any single charge moves in 
an irregular random walk, but summing over the many particles in a neighborhood 
of x creates the current density Jf(x). Bound current is different, and somewhat 
difficult to visualize. It is also due to motion of charge, 7 but the charge is bound 
within atoms or molecules. Orbital motion and spin of electrons contribute to the 
magnetic dipole moment of the atom; and those atomic currents, summed over the 
many atoms in a neighborhood of x, produce a current density Jb(x) = V x M, 
where M(x) is the magnetization. A bound current exists at x if the magnetization 
curls there. The bound current is just as much a source of B as the free current. 
Therefore Ampere’s law in a magnetic material is 

V x B = Mo(Jf + Jb) • (9.21) 

Now, Jb is the curl V x M, and it is convenient to combine the curls in (9.21) by 
writing 


V x H = J f , (9.22) 

where 

H = — - M. (9.23) 

MO 

Equation (9.22) is another form of Ampere’s law. Equation (9.23) defines a new 
field H(x) that is a combination of the magnetic field B(x) and the material field 
M(x). 

7 We’re using poetic license here. It is not rigorously correct to think of electron spin—a quantum 
phenomenon—as motion. 
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The nomenclature for the related fields B(x) and H(x) is not as standardized as 
one might wish. 8 One convention, used for example in the famous graduate text¬ 
book by J. D. Jackson, is to name B the “magnetic induction” and H the “mag¬ 
netic field.” However, B is the fundamental field that exerts the magnetic force on 
charged particles, whereas H is a derived auxiliary quantity that we introduce for 
convenience; so it is preferable to name B the magnetic field, in analogy with E 
the electric field. (This is the term used for B in this book.) An informal language 
is to call the two fields the B-field and the H-field, which is precise but not ele¬ 
gant. A common usage, especially in conversations, is just to call either one the 
magnetic field! 

Terminology aside, B and H are distinct fields. Note that B and H have differ¬ 
ent units. B is measured in teslas, where 1 T=1 N/(Am). H is measured in A/m, 
which doesn’t have a special name. In many cases B and H are proportional. For 
example, in a vacuum B = /iqH. In common materials (diamagnetic or paramag¬ 
netic) they are also proportional, B = n H, in which the parameter /.< is called the 
permeability of the material. In ferromagnets B is a nonlinear function of H, as 
we shall see in Sec. 9.6. 

In certain systems the forms of B and H are quite different, as we shall see 
in some examples. A fundamental principle of magnetism is that V • B = 0, 
expressing that magnetic monopoles—point sources from which B diverges—do 
not exist. In contrast V • H is not necessarily 0. This difference implies that B 
and H satisfy different boundary conditions at an interface between materials: 
The normal component of B is continuous; and, if there is no free surface current, 
the tangential component of H is continuous. We shall derive these boundary 
conditions presently. 9 Example 3 shows how B and H could be measured in a 
magnetic material, in principle. 


EXAMPLE 3 Consider a large piece of material that has a constant magnetic field 
and magnetization, respectively, Bi and Mi, which are parallel to each other; 
(9.23) holds, as always, so Hi = B\/p-o — Mi. 


(a) If a thin disk-shaped cavity is hollowed out of the material with the disk axis 
parallel to Bi, what are the fields Bo and Ho in the cavity near the middle 
of it? 

By symmetry Bo near the middle of the cavity is perpendicular to the faces 
of the disk. The normal component of B is continuous across a boundary so 
Bo = Bi and, from (9.23), Ho = Hi + Mi. (We have used Bo = IA)Hq in the 
cavity.) 

8 The long and interesting history of the magnetic vectors B and H, as well as the subtle concepts 
associated with these quantities, are discussed in Ref. [1]. 

9 See Section 9.3.4. The reader may wish to read Sec. 9.3.4 before Example 3, because the boundary 
conditions must be used in the example. 
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(b) If a thin needle-shaped cavity is hollowed out of the material with the needle 
axis parallel to B i , what are the fields Bo and Ho in the cavity near the middle? 

By symmetry Bo and Ho near the middle of the cavity are parallel to Bi 
and Hi, and tangent to the cavity walls. The tangential component of H is 
continuous across the boundary so in this case Ho = Hi and, from (9.23) 
Bo = Bi — moMi. 

(c) If a small spherical cavity is hollowed out, what are Bo and Ho in the cavity? 

By the superposition principle the field inside the cavity is the sum of the 
field in the bulk material plus the field of a superposed sphere with magnetiza¬ 
tion —Mi. Example 1 shows that the magnetic field in the superposed sphere 
is — |juoMi. Therefore inside the cavity we have 

Bo = Bi — |/uqMi and Ho = Hi + |Mi. 


The results of these three cases are summarized in a table: 


Cavity shape 


Bo 

Ho 

Disk 

Bi 


Hi +Mi 

Sphere 

Bi 

— f/uoMi 

Hi + iMi 

Needle 

Bi 

- moMi 

Hi 


Notice that the fields in the spherical cavity are intermediate between the fields 
in the other cavities. This is expected, because all three cavities are similar to 
special cases of an ellipsoidal cavity, with the sphere intermediate between the 
extremes of disk and needle. (An extremely oblate ellipsoid is disklike, but with 
a smooth surface, and an extremely prolate ellipsoid is needlelike.) The general 
problem of an ellipsoid of magnetic material in an external magnetic field can 
be solved analytically in terms of elliptic integrals. Therefore it is convenient to 
use ellipsoidal samples for measuring magnetic properties. The corresponding 
electrostatics problem of an ellipsoid of dielectric material in an external E can 
be solved similarly. 

Because the field Bo in a disk-shaped cavity is the same as the field Bi in the 
bulk material, and the field Ho in a needle-shaped cavity is the same as the field 
Hi in the bulk. Lord Kelvin introduced these cavities as operational ways of mea¬ 
suring the fields inside a magnetic material. An analogous procedure can be used 
to measure E and D in a dielectric material. 10 This emphasis on the importance of 
direct measurement of physical quantities dates from a time, a hundred years ago, 
when a quantity was only considered meaningful if it could be measured directly. 
But we don’t follow that ideal so closely anymore in physics. 


10 See Exercise 8. 
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9.3.1 ■ The Integral Form of Ampere's Law 

The differential equation (9.22) is equivalent to an integral relation for the circu¬ 
lation of H around a closed curve, i.e., an Amperian loop. By Stokes’s theorem 

(j> H • d£ = j Jf - dA = /f(enclosed). (9.24) 

Here C is any closed curve, S is any open surface bounded by C, and /f(enclosed) 
is the free current through S. The directions of d£ and dA are correlated by the 
right-hand rule, in the usual way for Stokes’s theorem. We have derived (9.24) 
from (9.22) but the derivation can be reversed because (9.24) must hold for arbi¬ 
trary C and S. 

9.3.2 ■ The Constitutive Equation 

The reason (9.22), or (9.24), is so important and useful is that the source term 
is only the free current—the familiar current due to motion of free charges. The 
bound current is hidden in H. But we cannot just ignore the bound current; it has 
physical effects. To use the field H in a calculation of the magnetic properties 
of a system, it is necessary to know from some other information the relation 
between B and H, or equivalently between M and H. The required information is 
a question of materials science. It is determined by the electronic structure of the 
atom. The equation relating B and H is called the constitutive equation. 

In most materials the magnetization M(x) is proportional to the field H(x). 
Such a material is called a linear magnetic material. The constant of proportion¬ 
ality Xm defined by 

M = Xm H, (9.25) 

is called the magnetic susceptibility. Then B is also a linear function of H 

B = ix H, (9.26) 

where the permeability p is 

M = Mo(l + Xm) (9-27) 

by the definition (9.23) of H. 

9.3.3 ■ Magnetic Susceptibilities 

Linear materials are classified as diamagnetic or paramagnetic. The definition of 
a diamagnetic material is that Xm < 0. In this class of materials the magnetization 
points in the direction opposite to the H field, M oc —H. In other words, the 
atomic dipoles are aligned opposite to H and B. Recall from Sec. 9.1 that putting 
an atom in a magnetic field creates a contribution to the magnetic moment, in the 
direction opposite to the field. The classical picture of diamagnetism, described 
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in Sec. 9.1, attributes it to electromagnetic induction, and the direction of the 
induced moment is opposite to the field by Lenz’s law. In diamagnetic materials 
this contribution to the atomic dipole moment is dominant, so M a —H and 
X, n < 0. As Table 9.2 shows, \xm I <3C 1; in this sense diamagnetism is a small 
magnetic effect. 


TABLE 9.2 Magnetic Susceptibilities 
of Some Elements and Compounds 


Diamagnetic elements 

Xm * 

H 2 (STP) 

-2.2 x 10~ 9 

He (STP) 

-1.1 x 10~ 9 

N 2 (STP) 

-6.7 x icr 9 

Si 

-3.3 x 10~ 6 

Ar (STP) 

-1.1 x 10~ 8 

Cu 

-9.6 x 10~ 6 

Xe (STP) 

-2.6 x 1(T 8 

Au 

-3.4 x ltr 5 

Pb 

-1.6 x 10~ 5 

Paramagnetic elements 

Xm 

0 2 (STP) 

+i.9 x ltr 6 

Na 

+8.5 X 10~ 6 

A1 

+2.i x io~ 5 

K 

+5.7 X 10~ 6 

Cr 

+2.9 x 10~ 4 

Rb 

+3.7 x 10~ 6 

W 

+7.0 X 10~ 5 

Nd 

+2.8 x 10- 4 

Gd 

+8.7 x 10“ 3 

Compounds 

Xm 

H 2 0(£, 293 K) 

-9.0 x 10~ 6 

CO (STP) 

-5.5 x 10~ 9 

NO (STP) 

+8.2 x 10~ 7 

C0 2 (STP) 

-1.2 x 10~ 8 

Si0 2 

-1.4 x 10~ 5 


a We use SI units, in which Xm is dimension¬ 
less. There will come a time when compre¬ 
hensive tables of Xm are available in SI units. 
Today, however, the most comprehensive ta¬ 
bles (Handbook of Chemistry and Physics, 
CRC Press, Boca Raton, 1999) list Xm in 
Gaussian units of cm 3 /mol. To convert to SI 
units, multiply by 4n/v m , where v m is the 
molar volume in cm 3 /mol. 
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The definition of a paramagnetic material is that Xm > 0- In this class of ma¬ 
terials the magnetization is in the same direction as the magnetizing field. Recall 
from Sec. 9.1 that an atom with a permanent magnetic moment will tend to align 
with B or H because of the torque m x B on the dipole moment. In paramag¬ 
netic materials the permanent dipole moment is greater than the induced dipole 
moment, so M oc +H and Xm > 0. 

Table 9.2 shows values of susceptibility Xm for some interesting elements and 
compounds; for gases the data are for STP, and for solids the data are for room 
temperature. The elements have been chosen so that there are at least two from 
each row of the periodic table. For each row, except the first, there is shown Xm 
for at least one diamagnetic and one paramagnetic element. (The first row of the 
periodic table contains only the elements hydrogen and helium; both H 2 and He 
are diamagnetic.) Notice that \x m \ for paramagnetic elements is generally larger 
than for diamagnetic elements. But even for paramagnetic materials, \x m \ <SC 1- 

The inert-gas elements are all diamagnetic; notice in Table 9.2 that for He, 
Ar, and Xe, the magnitude of Xm increases as the size of the atom increases. 
This reflects the increase in the factor r 2 in (9.5). The same effect shows up for 
CO and CO 2 ; the triatomic CO 2 molecule is larger than diatomic CO so that, in 
some sense, the average r 2 in (9.5) is larger. In general if an atom has a permanent 
dipole moment then the paramagnetic effect dominates the diamagnetic effect and 
the material is paramagnetic. Table 9.2 shows examples with a large range of x m 
for both diamagnetic and paramagnetic elements. 

The magnetic susceptibilities of the elements vary with position in the periodic 
table in a way that can be explained in terms of elementary atomic structure. 11 


EXAMPLE 4 (a) Sample calculation of diamagnetic Xm for H 2 . Now that we 
have a quantitative measure of magnetic susceptibility, it is instructive to inves¬ 
tigate how well (9.5), the result of our model calculation of diamagnetism, com¬ 
pares to an experimental value. The simplest example is hydrogen. 

Equation (9.5) gives the contribution to the magnetic moment of an atom from 
the change of the orbital motion of one electron in response to an applied field H. 
For an H atom, using r = ao = 5.29 x 10 -11 m (the Bohr radius) and substitut¬ 
ing 12 B = jtroH we obtain 5m = — (e 2 ao 2 M()/4rn e )H. Hydrogen gas is diatomic, 
so there are no data on Xm f° r a g as of H atoms, but as a rough approximation we 
can consider that 1 mole of H 2 molecules consists of 2 moles of H atoms. Then 
the number density of H atoms in H 2 at STP is n — 2N a/2.24 x 10 2 atoms/m 3 . 
The magnetic moment per unit volume—the magnetization—is then 


M = - 


ne 2 a^ii 0 

4m e 


H = XmH, 


(9.28) 


n K. Krane, Modem Physics, 2nded. (Wiley, 1996). 

12 In a magnetic material B = 1 . 1 H, but it is a good approximation to use B = hqH in (9.5) because 
Xm « 1. 



9.3 Ampere's Law for Free Currents, and H 


329 


which implies = —1.3 x 10 -9 for H 2 at STP. (There is no net contribution 
from the electron spins because the total spin is S = 0 in an H 2 molecule.) Con¬ 
sidering the approximations we have made this estimate is gratifyingly close to 
the experimental value —2.2 x 10 -9 shown in Table 9.2. 

(b) Sample calculations of paramagnetic It is also instructive to apply 
(9.6), the result of our model calculation for the magnetization of a paramagnetic 
substance, to experimental values. 

We first apply (9.6), or rather a quantum modification, 13 to NO, which has one 
unpaired electron spin per molecule. The molecular density is n = Na/2.24 x 
10 -2 m -3 , and again we may approximate B by //pH, so 


M = 


0 

kT 


H = XmH, 


(9.29) 


which gives = 7.7 x 10 -7 for NO at STP. This agrees quite well with the 
experimental value 8.2 x 10 -7 in Table 9.2. If we do the same calculation for O 2 
gas, considering that the O 2 molecule has 2 unpaired electrons, we find = 
1.5 x 10~ 6 , which compares favorably to the experimental value for O 2 . 

If we carry out a similar calculation for a metal, say Al, we obtain a value for 
X, n that is about 10 2 too large. The reason for this discrepancy is that in a metal 
the temperature that characterizes the electrons whose spins can line up param- 
agnetically, is not room temperature but a much higher quantum-mechanical tem¬ 
perature called the Fermi temperature. This is another indication that magnetism 
is a quantum phenomenon. There is even a theorem 14 that according to classical 
physics there can be no magnetism. 15 


The classic technique for measuring susceptibility is to measure the force— 
attractive or repulsive, respectively, on a paramagnetic or diamagnetic sample— 
produced by an external magnet, using a sensitive force balance. From the force, 
the magnetization and susceptibility can be calculated. 


9.3.4 ■ Boundary Conditions for Magnetic Fields 

A standard problem in magnetostatics is to calculate the fields for a specified 
system of free currents and magnetic materials. To solve such problems it is nec- 

13 Equation (9.6) is the classical formula for the magnetization. The quantum formula for a spin 1/2 
atom, which we use here, is the classical result multiplied by 3. 

14 The theorem is due to J. H. van Leeuwen. An accessible explanation is in The Feynman Lectures , 
Vol. II, Sec. 34-6. 

15 C. Kittel once put it this way: “.. .and were the value of h to go to zero, the loss of the science of 
magnetism is one of the catastrophes that would overwhelm the universe.” C. Kittel. Introduction to 
Solid State Physics, 4th ed., Chap. 15 (Wiley, New York, 1971). 
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(a) (b) 


FIGURE 9.10 Boundary conditions for magnetic fields. 5 is a surface separating re¬ 
gions with different permeabilities, (a) A Gaussian pill box G cuts through S, with area A 
on opposite sides of S and height h. (b) An Amperian loop C cuts through S, with sections 
T on opposite sides of S and width w. 


essary to know the boundary conditions for the fields. We will derive the general 
boundary conditions from the field equations. 

The field B(x) satisfies V • B = 0 in any magnetic system, because there 
are no magnetic monopoles. This equation implies that the normal component of 
B is continuous across any two-dimensional surface S. Figure 9.10(a) shows a 
Gaussian surface G shaped like a pill box, with similar faces A on opposite sides 
of S, and height h. The flux of B outward through G is 0, by Gauss’s theorem. 
In the limit h —*■ 0 the flux through the edge approaches 0, so the flux through 
the two faces must be equal but opposite. Thus, letting Bi and B 2 be the limits of 
B approaching opposite sides of S, and n the unit normal vector in the direction 
1 —>• 2, Gauss’s law implies 

/ B 2 • hdA = / B] hdA. (9.30) 

Ja Ja 

As this must be true for any area A on S, 

B 2 -n = Bi-n. (9.31) 


B n is continuous. 

The field H(x) satisfies V x H = Jf. This relation implies that the tangential 
components of H are continuous across S if there is no free suiface current. Oth¬ 
erwise, if there is a free surface current density Kf, then H has the discontinuity 
Kf x n. Figure 9.10(b) shows an Amperian loop with width w, whose sides on 
opposite sides of S approach a curve T on S as w —> 0. By Stokes’s theorem, the 
circulation of H around the loop equals the free current enclosed by the loop. In 
the limit w -*■ 0, the circulation approaches the difference of the line integrals of 
Hi and Hi along T, and the enclosed current is the free surface current integrated 
across T (out of the page in Fig. 9.10(b)). Thus by Ampere’s law 
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f (H 2 - Hi) • dt = /r (K ' x n) • dt. (9.32) 

To see that the right-hand side of (9.32) is the surface current passing through the 
loop, consider that the vector Kf x n lies on S and is perpendicular to Kf. Only the 
component of Kf perpendicular to T contributes to the line integral on the right- 
hand side. As (9.32) must hold for any T, but only depends on the components of 
H tangent to S, we may conclude 


H 2 ? - Hi, = Kf x n (9.33) 

where H, denotes the tangential part of H, i.e., H H n n. If there is no free surface 
current, then H, is continuous. The conditions (9.31) and (9.33) are true for any 
surface. The most important applications are where S is an interface between 
different materials. 

We will use these boundary conditions in some interesting magnetic field cal¬ 
culations. 

Boundary Conditions of Magnetic Potentials 

In magnetic field calculations it is often useful to introduce potentials, A(x) or 
dm (x). For any magnetic field we may write B = V x A, because V • B = 0. The 
vector potential A(x) was introduced in Sec. 8.5. The boundary condition on A(x) 
is that the tangential components of A are continuous across S. Consider again the 
Amperian loop in Fig. 9.10(b). By Stokes’s theorem, the circulation of A around 
the loop equals the flux of B through the loop. As w —0 the flux approaches 0 
with the area of the loop, and the circulation approaches f r (A 2 — Ai) • dt\ thus 
the boundary condition on the vector potential is 

A 2 , =Ai,. (9.34) 

For certain problems a scalar potential dm (x) may be employed. In a region where 
there is no free current we may write H = — Vdm, because V x H — 0. (This 
technique will be used in Sec. 9.5.) The boundary condition on dm (x) is that dm 
is continuous across S, 


dm2 — dm\• (9.35) 

It can be shown that the continuity of </> m (x) implies that the tangential compo¬ 
nents of V0 m are continuous, as required by (9.33) with Kf — 0. 16 


9.4 ■ PROBLEMS INVOLVING FREE CURRENTS AND MAGNETIC MATERIALS 

In Examples 1 and 2 we found the B field due to magnetized objects, but with 
no free currents. A sphere or cylinder made of magnetized iron is equivalent to 

16 


'See Exercise 10. 
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Example 1 or 2, respectively. Now that we have developed the mathematical prin¬ 
ciples and physical insight required to find the B and H fields due to free currents, 
we can solve problems involving both currents and linear magnetic materials. Ex¬ 
amples 5 and 6, which have planar and cylindrical symmetry respectively, are 
problems of this kind. 


EXAMPLE 5 In this example we consider an infinite slab of a conducting material 
with magnetic susceptibility x,„, carrying a certain current distribution. Figure 
9.11 shows the system. The slab is parallel to the xy plane, between z — —a and 
z — a. It carries a free volume current density Jf(z) = (Joz/a) i, which is plotted 
in Fig. 9.12. Above the xy plane the current is out of the page, below it is into the 
page, and the integrated current density is 0. Outside the slab is vacuum. What are 
H, M, and B? 

We’ll solve this problem in two ways: first using Ampere’s law and second by 
the superposition principle. In order to use (9.24) effectively we take advantage of 
two symmetry considerations. Notice first that the conducting slab may be consid¬ 
ered as made up of infinitesimal sheets of free current parallel to the xy plane. In 
Example 4 of Chapter 8 we found the magnetic field B due to a current sheet with 
surface current density K i; from Ampere’s law the field is — above 

the sheet and +(im)K /2) j below. We are now interested in H for a current sheet 
with free current per unit length K — K i; by (9.24) 

H(x) = — (AT/2) j above the current sheet, (9.36) 

H(x) = +(K/2) j below the current sheet. (9.37) 

Similarly, the planar symmetry implies for the slab of current of Fig. 9.11 that the 



FIGURE 9.11 Magnetic conducting slab. The slab of material has free current density 
(Joz/a) i, out of the page for z > 0 and into the page for z < 0. The rectangular loop 
TUVWT is used to determine the field H from Ampere’s law. 
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H y (z) 



FIGURE 9.12 Current density J and field H of the magnetic conducting slab. 


field cannot depend on x or y, and H must have the form H(x) = H y (z) j; i.e., H 
has only a y component but it may vary with z. 

The second symmetry consideration is that H = 0 outside the slab, i.e., for 
z > a or z < —a. The reason is that the net current flowing in the slab is zero, 
as is clear from the graph on the left of Fig. 9.12. H(x) from a sheet of current is 
independent of the distance from the sheet, so the field above or below the slab, 
due to the superposition of thin sheets with net current 0, is 0. 

Now apply Ampere’s law to the closed Amperian path TUVWT in Fig. 9.11. 
The only contribution to the line integral comes from VW, whose length we call 
£; H is 0 along TU, and H is perpendicular to UV and WT. Thus Ampere’s law 
states 


from which 


Hy(z)e = J^ ~£dz 


a 


Jo, 


H y (z) = z~(a 2 - z 2 ) for -a<z<a. 
2 a 


(9.38) 


(9.39) 


Equation (9.39) holds throughout the slab because (9.38) would be the same even 
if the Amperian path were extended into the region z < 0. Figure 9.12 shows a 
plot of H y (z) on the right. 

Let us now calculate H y (z) in another way—by integrating the contributions of 
the infinitesimal sheets that make up the entire current flow. The idea here is that 
dH(z) = T(dK / 2) j above or below the sheet, where the sheet current density is 
dK = (Joz'/a)dz'■ For a field point z between —a and a 


Hy(z) = +1 f J -^-dz' - \ T —dz\ (9.40) 

2 J z a 2 J- a a 


which gives again (9.39). Notice that the first term in (9.40), which comes from 
the current above the field point, equals the second term, which comes from the 
current below the field point. 
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Finally we calculate the magnetization, magnetic field, and bound currents. 
Inside the slab, M = / m H and B = fio (1 + / m )H. Outside the slab both M and 
B are zero. The bound surface current density Kb = M x n is 0 at both upper and 
lower slab surfaces because H = 0 there, and therefore M = 0 too. The bound 
volume current density is 

J b= VxM=—i. (9.41) 

a 

Note that Jb is proportional to Jf, so it too is an odd function of z. Therefore the 
net bound current in the slab is zero, like the net free current. 


EXAMPLE 6 What are H, M, and B for the coaxial cable shown in cross section 
in Fig. 9.13(a)? The inner and outer conductors carry equal currents in opposite 
directions. The currents are distributed uniformly in the conductors. The conduc¬ 
tors are separated by an insulator with susceptibility / m . 

Let / be the current, and let the z axis point out of the page. The current den¬ 
sities are 



FIGURE 9.13 (a) Coaxial cylinders separated by a magnetic material. There is cur¬ 
rent out of the page in the inner cylinder (0 < r < a) and into the page in the outer cylinder 
(b < r < c) both distributed uniformly. Between the cylinders (a < r < b) the magnetic 
susceptibility is Xm ■ (b) Magnetic field as a function of radius for the coaxial cable. 
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7 is a free current, and by symmetry H must have the form H = H ( p(r)4>. Now 
we apply Ampere’s law. Taking the line integrals counterclockwise around each 
of the four circular paths shown in Fig. 9.13(a) gives 


-- for r< a (Path 1) 

2na l 

(9.44) 

7 

-— for a 
2 nr 

< r < b (Path 2) 

(9.45) 

I(c 2 -r 2 ) 

for b <r < c (Path 3) 

(9.46) 

2 nr{c 2 — b 2 ) 

0 for r > c 

(Path 4). 

(9.47) 


In deriving these results notice that Path 1 encloses only that part of +/ inside 
radius r. Path 2 encloses all the current +7, Path 3 encloses the current +7 and 
also that part of —7 between b and r, and, finally, the total current enclosed by 
Path 4 is 0. Fig. 9.13(b) shows 

As usual, in the region of the magnetic material a < r < b, the field B is 
W)0 +Xm) H; elsewhere it is just /roH. In the magnetic material the magnetization 
is M = Xm (7/2 nr) (f>; this function is curl free so there is no bound current in the 
volume of the material. There are, however, bound surface currents: At r = a the 
normal vector out of the magnetic material is n = — ?, so Kb = M x (—?) = 
Xm (//2.T«)k; the total bound current at this interface is / m 7 in the +k direction. 
At r = b, where n = r, we have K/, = —Xm(7/2;r&)k; the total bound current at 
this interface is / m 7 in the —k direction. The sum of the bound currents is zero. 


Example 5 is a situation in which there is J/, but no KExample 6 is a situ¬ 
ation in which there is K/, but no J/,. The general rule is that in a uniform linear 
magnetic material there can be bound volume current only in a region where free 
current flows. Consider the curl of both sides of the relation M = x„,H; the re¬ 
sult is 


V x M = / m V x H or Jb = XmJf- (9.48) 


9.5 ■ A MAGNETIC BODY IN AN EXTERNAL FIELD: THE MAGNETIC SCALAR 
POTENTIAL 0 m (x) 

We consider next a class of magnetostatics problems that can be conveniently 
solved by using a magnetic scalar potential <p m (x) which is a solution to Laplace’s 
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equation. This method has the advantage that we’ll be able to bring to bear on 
magnetostatics the considerable mathematical armamentarium we developed for 
electrostatics. 

The situation in magnetostatics is analogous to the electrostatic case so a quick 
review is in order. In electrostatics we always have V x E = 0, which permits 
us to write E = — W, where V is the electrostatic potential. For uniform linear 
dielectrics we also have V • D = eV • E = pf, so that if there is no free charge 

V • E = 0, and then V satisfies Laplace’s equation. 

In magnetostatics we have V x H = Jf, so that if there are no free currents 

V x H = 0. This permits us to introduce the magnetic scalar potential <p m (x) 
where 


H(x) = -V0 m . (9.49) 

We emphasize that the scalar potential (9.49) can only be used in regions of x 
where there is no free current. If the magnetic medium is linear then we also have 
V B = V ( A zH) = 0; and if, in addition, p. is constant in the region of interest, 
then V • H = 0. Combining this with (9.49) gives 

V 2 0 m = 0. (9.50) 

Equations (9.49) and (9.50) taken together with the boundary conditions (9.31) 
and (9.33) will enable us to find 4>„, (x) and the resulting fields. We will now use 
this method to solve two interesting problems. 


EXAMPLE 7 What are the fields inside and outside a sphere of permeability p in 
a uniform applied field B = Bok? 

The sphere, with radius b, and the magnetic field are shown in Fig. 9.14. We 
will proceed in the now familiar way by judiciously guessing the form of <p m (x) 
from the symmetry of the problem. If <p m (x) satisfies (9.50) and the proper bound¬ 
ary conditions then it will give the correct fields, by the uniqueness theorem. But 
if our guess for the form of <p m (x) is wrong, we will not be able to satisfy the 
boundary conditions. 

Notice first that the scalar potential corresponding to the external fi eld alone is 

n Bo Bo 

4>m (x) =-z =- rcosO. (9.51) 

B-0 B-0 

This suggests that <p m (x) may need no other angular dependence than cos 9. The 
only two solutions to Laplace’s equation in spherical coordinates that are propor¬ 
tional to cos 9 are r cos 0 and cos 6/r 2 . The first corresponds to a uniform field, 
and the second to a point dipole at the origin. Thus we are led to try 

cpln'ir, 9) = ar cos 9 for r < b 


(9.52) 
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FIGURE 9.14 Example 7. A solid sphere is placed in a uniform applied magnetic field, 
but the total magnetic field is not uniform because of the effect of the material. The figure 
shows the B-field lines for the case of a paramagnetic sphere. 


B(\ cos 0 

<Pm X {r, 9) = - -r cos9 4- /3—for r > b. (9.53) 

MO r 2 

The scalar potential for a dipole m at the origin is 

^iP°le (x ) = J_ 5LE, (9 .54) 

4rr r z 

so the second term in (9.53) corresponds to a dipole m = 4jr/?k. A dipole term 
cannot be included in (9.52) because it would be singular at the origin. 

The two constants a and ft will now be determined from the two boundary 
conditions. One boundary condition is the continuity of the tangential component 
H e of H at r = b. In terms of the scalar potential this condition is 


i i (WT \ 

b \ d6 )r=t> b ' 96 

which gives 

b 3 a - p = - b 3 B 0 /lM ). 


(9.55) 


(9.56) 


(The same equation could be obtained from the condition that <p m is continuous.) 
The other boundary condition is the continuity of the normal component B r of B 
at r = b. Because B is mH inside the sphere and mqH outside, the condition is 


-M 


dr 


r=b 



(9.57) 


which gives 
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K m b 3 a + 2fi = -b 3 B 0 /vo- 


(9.58) 


In (9.58), K m = fx/fxo is the relative permeability. 

The solutions to (9.56) and (9.58) are 

-3 B 0 (K m - l)B 0 b 3 

a = - and /S =-. 

(K m + 2)fX0 (Km + 2)fX 0 

Finally, substituting these into (9.52) and (9.53) gives the potentials 


(9.59) 


CV, 9 ) = 


—3fl 0 


(K m + 2)/X 0 


-r cos 0, 


Bo 


C(r, 9) = -- r cos 9 + 

VO 


(K m — l)B 0 b 3 COS 6 
(K m + 2)/x 0 r 2 


(9.60) 

(9.61) 


We note again that the magnetic scalar potential is continuous at r = b in accord 
with the boundary condition (9.35). 


The internal fields. The calculations of the fields in the region r < b are 


H int = V<C = 3/i ° k, 

(Km + 2)H0 

(9.62) 

B int = /xH int = 3l<mBo k, 

(Km + 2) 

(9.63) 

(Km + 2)H0 

(9.64) 


The interior fields are uniform, i.e., independent of x, and point in the +k direc¬ 
tion. The magnetization is also uniform. M points in the +k direction if K m > 1 , 
a paramagnet; or, M points in the — k direction if K m < 1, a diamagnet. 

If the sphere were nonmagnetic, i.e., K m = 1, then B int = Bo as expected. A 
paramagnetic sphere has K m > 1 and so B mt > B o; a paramagnet concentrates 
the magnetic field lines. (Figure 9.14 illustrates this case.) As K m increases B int 
increases until, in the limit of large K m , B int ->■ 3Bo. This limit is easy to reach 
because for many ferromagnetic substances K m is larger than 10 3 . A diamagnetic 
sphere has K m < 1 and so B mt < Bo; a diamagnet expels the magnetic field lines. 
The extreme limit /e m —0 is called a perfect diamagnet. In this case the field B int 
is 0, i.e., the magnetic field does not penetrate into the material. A superconductor 
is essentially a perfect diamagnet. 


The external fields. In the external region r > b we have 


B ext = /x 0 H ext 


-Vo^fiT = fi ok + 


Km 1 
Km T 2 



Bo(2t cos 9 + 0sin0). 

(9.65) 
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The first term in (9.65) is the external field that we specified. The second term 
is the field of a point dipole at the origin with m tota i = (4/3)7rfo 3 M, the total 
magnetic moment of the sphere. This shows, as we also found in Example 1, that 
the external field produced by a sphere with constant magnetization is the same 
as that of a point dipole at the origin. 

We now ask a natural question about this problem. All the fields, H, B, and 
M, have zero divergence and zero curl both inside and outside the sphere. Yet the 
fields themselves are nonzero and interesting. What and where are the sources of 
the fields? The answer is that the sources of the applied field are essentially at 
infinity—picture large north and south magnetic poles far away—and the other 
sources are bound surface currents at r = b. 


Magnetic Shielding 

Now we tum to an example with practical importance. It is sometimes desirable to 
produce a magnetic-fi eld-free region, say a laboratory room or a box for electronic 
circuit components. Magnetic shielding means surrounding the volume of interest 
with a layer of material with high permeability /i. A material used for this purpose 
is Mumetal, the alloy Ni 0 . 77 Fe 0 .i 6 Cu 0 . 05 Cr 0.02 with i±/i±o = 10 5 . If /1 is large and 
the layer is thick then any external B concentrates in the magnetic layer so that 
the field inside the cavity can, in principle, be made arbitrarily small. The next 
example illustrates magnetic shielding for spherical geometry. 


EXAMPLE 8 What is the field inside a spherical cavity of radius Ri that is sur¬ 
rounded by a concentric spherical shell of permeability m, if there is an external 
applied field B — B^k? 

Figure 9.15 shows the arrangement. The magnetic layer—the shield—extends 
from radius Ri to R 0 . The problem is to find the field inside radius Rj as a function 
of the relative permeability K m = n/no- 

This problem is similar to Example 7 because it has the same applied field and 
a spherical body. The key difference is that here we must find the magnetic scalar 
potential in three regions. Because the only angular dependence is proportional to 
cos 0 we are now led to try 

4>'md, 6) = ar cos 9 for r < /?, (9.66) 

0m etw ( r > 0) = Y r cos<9 -f & C ° S ® for Ri < r < R 0 (9.67) 

r z 

6) = - — rcosO + for r > R 0 . (9.68) 

MO r l 

The four coefficients a, f),y, and S are determined as before from the boundary 
conditions. First, B r is continuous at the interfaces, r = Ri and r — R 0 , 
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B 


B 


FIGURE 9.15 Example 8: Magnetic shielding. A magnetic material fills a spherical 
shell with inner radius Ri and outer radius R 0 . To what extent does the external field 
penetrate into the interior? 



9C‘ 9^ etw 

-MO —— = —M—5- 

dr dr 

9 ^ex, 

M—9— = -Mo—9— 
dr dr 


at r = R, 
at r = R a . 


Also, Ho is continuous at the interfaces, or equivalently, and more simply, 4> m is 
continuous there, 


= at r = R, 
0„ b , etw - C‘ at r = Ro. 


These conditions give the following four linear equations in the unknowns 


Ri 3 a - K m Ri 3 y+ 2K m 8-0 (9.69) 

Ri 3 a - R t 3 y -8 = 0 (9.70) 

2/3 + K m R 0 3 y - 2x m 8 — -R 0 3 B 0 /^o (9.71) 

— R Q 3 y — 8 — R 0 3 Bq/im). (9.72) 


The solutions are 


-9 K„ 


a — 


(2 K m + l)(K m + 2) — 2 (Rj/R 0 ) (k„ 


_ /Bo\ 

-1) 2 Vmo/ 


(9.73) 


( 2 K m + 1 \ 

K “ ( 3 K m ) a 


(9.74) 
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FIGURE 9.16 Field lines for magnetic shielding. 



+ l)(<c m - 1)(^ 3 
-9 K m 


(9.75) 

(9.76) 


In taking <p m to be solutions of Laplace’s equation we assumed that V x H = 0 
and V • H = 0. Because there is no free current it is clear that V x H = 0. 
Straightforward calculation from the fields in the three regions shows that V • H = 
0 everywhere, as well. 

For the sake of completeness all the coefficients have been given. But we are 
particularly interested in a, because the field in the cavity is B int = —/loak = 
B' nt k. In the limit of large K m we have, from (9.73), 



FIGURE 9.17 Field in a shield. The plot shows B z (r)/B q on the xy plane for Example 
8, with n/fM) = 20. Here So is the asymptotic field strength. Note that the field inside the 
cavity is smaller than the asymptotic field. In the limit /.t —> oo the interior field —> 0. 



342 


Chapter 9 Magnetic Fields and Matter 


B 


int 

z 


9 Bq Bq 

2fc m [l — (Rj / Rg)^ 1 ] K m 


(for Km » 1). 


(9.77) 


Equation (9.77) means that if K m is large then the interior region r < R, is mag¬ 
netically shielded from external fields. Figure 9.16 illustrates how the field lines 
of the external field concentrate in the magnetic shield. As a numerical example of 
the effectiveness of this shielding: The magnetic field inside a room whose sides 
are about 4 m, and which is shielded by a 1 cm thick layer of material whose 
K m = 10 4 , would be about 0.03 of the field outside. 

Figure 9.17 shows B z (r) on the midplane (z = 0) of the sphere, for ii/iM) = 
20, showing quantitatively how B concentrates in the shield. 


9.6 ■ FERROMAGNETISM 

Ferromagnetism is the magnetic phenomenon most familiar from everyday ex¬ 
perience. Horseshoe magnets, bar magnets, refrigerator magnets, and compass 
needles are all ferromagnets. Ferromagnets are also key components in motors 
and generators, transformers, loudspeakers, microwave ovens, and other common 
household and workplace devices. 

Ferromagnetic materials have a wide range of physical properties, but one key 
characteristic is that they are easy to magnetize. Even a small magnetic field ap¬ 
plied to a ferromagnet can result in its developing a large magnetization. A ferro- 
magnet can have large magnetization even in the absence of an externally applied 
field—that’s what a permanent magnet is. 

Only three elements, Fe, Co, and Ni, are ferromagnetic at room temperature, 
although many alloys are ferromagnetic at room temperature and several elements 
become ferromagnetic at lower temperatures. Indeed, one of the important charac¬ 
teristics of ferromagnetism is its temperature dependence. Each ferromagnet has 
a characteristic temperature, called the Curie temperature (7c), above which it is 
no longer ferromagnetic but becomes paramagnetic. For Fe, Tq — 1043 K; for 
Co, T(- — 1388 K, and for Ni, T(- — 627 K. Two elements that are ferromagnetic 
below room temperature are Gd (7c = 292 K) and Dy (7c = 88 K). A ferro¬ 
magnetic material undergoes a magnetic phase transformation—a phenomenon 
of great interest in statistical mechanics—at the Curie temperature; i.e., its mag¬ 
netization behaves in a characteristic way as T approaches T(- from below and 
its susceptibility behaves in a characteristic way as T approaches Tq from above. 
This contrasts with the temperature dependences of diamagnetism and paramag¬ 
netism, which were discussed previously. 

What is the microscopic source of the large magnetization of ferromagnets? 
Why is the magnetization permanently retained even in the absence of an exter¬ 
nally applied field? Why does ferromagnetism disappear when the temperature is 
raised? 
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The microscopic sources of magnetization in ferromagnets are the individual 
atoms, which have permanent magnetic moments due to unpaired spins. An iron 
atom, for example, has two unpaired electron spins, in the 3d shell, so that the 
atom has a net magnetic moment of 2/z fl . If the magnetic moments in neighboring 
atoms interacted only via the classical dipole-dipole interaction, then they would 
align themselves antiparallel, like this, tl, because that is the lowest energy state 
for two classical dipoles. However, in a ferromagnetic material there is a much 
stronger interaction between neighboring atoms, called the exchange interaction, 
and of quantum mechanical origin, with the property that the lowest energy state 
is for the atomic magnetic moments to be parallel, like this, tt- 

A macroscopic piece of unmagnetized Fe consists of many magnetic domains, 
each containing from 10 17 to 10 21 atoms. In each domain essentially all the Fe 
atoms have their moments aligned, even at room temperature. Even a single crys¬ 
tal of Fe has many domains of this kind. If the surface of the material is suitably 
etched the domains, which have volumes of 10 -8 to 10 -12 m 3 , can be seen un¬ 
der a low power microscope. An unmagnetized piece of Fe has no net magnetic 
moment because the domains are randomly oriented. It is interesting to calculate 
the magnetization M in a domain. The number density of Fe atoms in the solid is 
n = 8.5 x 10 28 m~ 3 so that M = 2g fl n = 1.6x 10 6 A/m. This corresponds to a 
large internal magnetic field as we can see by finding the field inside an Fe sphere 
uniformly magnetized to the saturation magnetization. That field, according to 
(9.17) of Example 1, is B = \hqM = 1.3 T. 

When an external H field is applied to unmagnetized Fe, the domains gradu¬ 
ally become aligned in the direction of the field. (If the Fe sample is placed in a 
detection coil with an audio amplifier, the flipping of domains is audible, a demon¬ 
stration known as the Barkhausen effect.) More favorably oriented domains grow 
at the expense of less favorably oriented ones as the boundaries between domains 
move, and unaligned domains can rotate in the direction of the applied field. In 
single crystals the alignment occurs easily, but generally domain growth can be 
slow, depending on the grain structure of the sample. As a result of the orientation 
of domains, the piece of Fe becomes magnetized in the direction of the applied 
field. This magnetized state is energetically favored because of the exchange inter¬ 
action, so the Fe remains magnetized when the external field is removed. Because 
ferromagnetism is a result of magnetic ordering it is not surprising that raising the 
temperature sufficiently destroys ferromagnetism. If kT is large compared to the 
energy of the exchange interaction then thermal fluctuations eliminate the align¬ 
ment. The precise calculation of Curie temperatures and magnetic behavior near 
Tc is a large and complex subject. 

9.6.1 ■ Measuring Magnetization Curves: The Rowland Ring 

To characterize the properties of a ferromagnetic material, we measure its mag¬ 
netization curve, which is the curve of M versus H or, more commonly, B versus 
HoH. These are equivalent ways to express the properties of the magnetic material 
because the quantities are related by B = /xq(H + M), according to (9.23). 
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FIGURE 9.18 A Rowland ring for measuring magnetization curves. When the current 
in the primary windings is increased incrementally there are corresponding incremental 
increases A H and All in the toroid of magnetic material. The amount of charge that flows 
through a galvanometer is proportional to AS. 


Figure 9.18 shows a Rowland ring, a standard device for measuring magnetiza¬ 
tion curves. The toroid of magnetic material is wrapped with N p turns of primary 
windings. A current / p is sent through them. That produces in the material the field 
H = HO = (/j,()NpIp/27TR)6, by Ampere’s law. N s turns of secondary windings 
are also wrapped around the toroidal core, which are connected to a galvanometer. 

The principle involved in measuring magnetization curves with a Rowland ring 
is that H is applied by the experimenter —H is the independent variable—and in 
response to H the material develops a magnetization M and an internal magnetic 
field B. Assume the material is isotropic so that all these fields are parallel, in the 
azimuthal direction. In practice H is incremented in known steps A H, by chang¬ 
ing I p in steps A/ p . This produces changes Afi of the magnetic field inside the 
material. What is actually measured is the amount of charge A Q that is driven 
through the galvanometer by electromagnetic induction as a result of the change 
Afi of magnetic field. We defer explanation of electromagnetic induction until 
Chapter 10 but anticipate the result (Exercise 30 of Chap. 10), which is that Afi 
is proportional to A Q and can therefore be determined from the measurement of 
A Q. Thus, one starts with H = 0 and an unmagnetized sample. The magneti¬ 
zation curve is then built up by measuring Afi/ AH for the incremental changes 
described. Magnetization curves are usually plotted as B versus no H, because 
then both ordinate and abscissa have the same units. This choice of axes has the 
advantage that the slope of a straight line, or the derivative of the magnetization 
curve, is dimensionless. Notice that if the core material were removed from the 
Rowland ring then the magnetic field would be no H in the empty toroid. 
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FIGURE 9.19 Ferromagnetic behavior. The magnetic field B is plotted versus /xq// in 
the material. Some important parameters are the saturation magnetic field B s , the remanent 
field B r at hqH = 0, and the coercive force H c . 


9.6.2 ■ Magnetization Curves of Ferromagnetic Materials 

For diamagnetic and paramagnetic materials the magnetization curves are straight 
lines, with slope of B versus fioH equal to k,„ = n/Ha = 1 + x,„, where K m is 
the relative permeability, a dimensionless quantity. For paramagnets K m is slightly 
larger than 1 and for diamagnets it is slightly less than 1, as we saw in Table 9.2. 
The magnetization curves are straight lines because in these linear materials B is 
directly proportional to H. In other words, the susceptibility Xm and therefore the 
permeability /x are independent of H. 

Ferromagnetic materials are nonlinear so their magnetization curves are com¬ 
plicated. Figure 9.19 shows a schematic plot of B versus hqH for a typical ferro¬ 
magnetic material. Starting at the origin with an unmagnetized sample, the mag¬ 
netization curve follows the path 0 a as the experimenter increases H. The ini¬ 
tial increase in B is the result of domain growth. Favorably oriented magnetic 
domains, i.e., those with M pointing in or near the direction of H, grow at the 
expense of less favorably oriented ones, and larger domains grow at the expense 
of smaller ones. The curve eventually flattens out as it approaches a, where the 
material reaches its saturation magnetization fi s ; the corresponding abscissa at 
saturation is noH^. Further increase in H does not result in increasing magne¬ 
tization. The reason the curve flattens out approaching saturation is because the 
dominant mechanism in this region is rotation of domains to line up more closely 
with H, a difficult process. 

If the experimenter then decreases H, the magnetization decreases but the de¬ 
pendence follows a different path—the path ab —to the point where H = 0. At 
H = 0 the material is left with the remanent field B r . This B x is the field of 
the magnetization that remains in the sample with zero applied field. For a good 
permanent magnet B r is large, almost as large as B s . Continuing from b to c, H 
is becoming more negative, i.e., it is increasing in the direction opposite to its 
initial direction, so that the sample is becoming demagnetized. When H reaches 
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the value H c , B is zero. H c is called the “coercive force,” although its units are 
not those of force but rather A/m. If H is then further increased in the negative 
direction the curve eventually reaches the point d, at which the sample is again 
magnetized to saturation but this time in the opposite direction from that at a. 

The entire cycle abcdefa is called the hysteresis loop for the material. If H is 
varied slowly enough then the path defa is symmetric with the path abed, and the 
loop can be reproducibly retraced many times. The area inside the loop, <£ B • r/H, 
is the work that must be supplied to bring unit volume (1 m 3 ) of the material 
through a cycle. This work goes into heating the magnet. 

Figure 9.19 is a qualitative schematic drawing. But it is important to know that 
the magnetic properties of real ferromagnets vary over a wide range. This subject 
is large, of course, but three examples in the table below give an illustration of the 
range. 


Material 

B s 

H c 

Energy loss per 
hysteresis cycle 

Commercial Fe (99.8% pure) 

2.2 T 

80 A/m 

250 J/m 3 

Fe (99.95% pure) 

2.2 T 

4 A/m 

30 J/m 3 

Supermalloy (Ni, Fe, Mo) 

0.8 T 

0.3 A/m 

2 J/m 3 


Ref.: AIP Handbook, Table 5f-l 1, 3rd ed„ 1982. 


It is also important to know that the range of the horizontal axis (poH) is much 
smaller than the range of the vertical axis ( B ), because in a ferromagnet, a little 
H goes a long way toward producing magnetization. 

The quantitative properties of ferromagnetic materials are important in the de¬ 
sign of applications. For the cores of transformers, ferromagnetic materials are 
used for which H c is small (1-10 2 A/m) and for which the energy loss per cycle 
is small. The idea is that the internal field can be easily reversed and the trans¬ 
former can be run at, say, 60 Hz, without overheating the core. Hysteresis curves 
of such materials are very narrow. For the core of relay solenoids, ferromagnetic 
materials are chosen that have a small B r so that when the current in the energiz¬ 
ing coil is turned off the relay will open easily; materials with this property are 
called “soft” magnets. For permanent magnets the desirable properties are large 
B r and large H c (10 3 —10 5 A/m). Materials with large B r are called “hard” mag¬ 
nets. A large H c prevents the magnetization from being disturbed by stray fields, 
a property that is important in magnetic tapes, for example, on which information 
is stored in small ferromagnets with large H c . 

9.6.3 ■ The Permeability of a Ferromagnetic Material 

A ferromagnetic material is often characterized by its permeability p or its relative 
permeability x m = p/po- For ferromagnetic materials, p and x m are functions of 
H. This is in contrast to diamagnetic and paramagnetic materials, in which they 
are constants independent of H. 
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As we’ve learned from Fig. 9.19 and the associated discussion, the B versus 
jt()H magnetization curves of ferromagnets have complicated shapes, of the form 
B = B(H). For ferromagnets we define i-t(H) and K m (H) by 




B(H) 

H 


, B(H) 

and K m (H) = --. 

Mo H 


(9.78) 


Therefore, in Fig. 9.19, K m is the slope of the straight line drawn from the origin to 
the point (/ao H, B). Correspondingly, if we had plotted the magnetization curve 
as B versus H, the slope of the straight line drawn from the origin to the point 
(H, B) would be /.a. To characterize fully the permeability of a ferromagnet would 
require the whole permeability function, but in practice the initial and maximum 
values of K m are given for the path Oa, over which the ferromagnet is initially 
magnetized. For the three materials in the table above these quantities are: 


Material 

K m (initial) 

/c m (rnax) 

Commercial iron 

200 

5,000 

Pure iron 

10,000 

200,000 

Supermalloy 

10 5 

10 6 


In some contexts /.a is defined as AB/AH, where AH is a small change in 
H. As we saw previously, this ratio is the quantity measured with a Rowland 
ring. For diamagnetic and paramagnetic materials this definition is equivalent to 
that in (9.78). But for ferromagnets the two definitions are very different things. 
Forexample, at saturation B(H)/H is usually very large, but AB/AH, which is 
essentially the slope at saturation, is near zero. 

Figure 9.20 shows actual magnetization curves for some ferromagnetic ma¬ 
terials. This plot shows log B versus log H, so that K m is constant, as labeled, 



//[A/m] 

FIGURE 9.20 Magnetization curves of some ferromagnetic materials, (a) Supermalloy, 
(b) Mumetal, (c) Pure Iron Sheet. The diagonal lines correspond to constant values of the 
relative permeability K m = /a//aq. 
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FIGURE 9.21 Example 9. A pencil of Mumetal in a solenoid. 


on the dashed straight lines. This plot also shows clearly how K m varies as the 
field changes. For example, for Mumetal (curve b), K m starts at 30,000, then in¬ 
creases to 100,000, and then decreases to below 1,000. Notice also the difference 
in ranges on the abscissa and ordinate, mentioned previously; the value H = 10 
A/m corresponds to the value noH = An x 10 6 T. 

We now give two examples which use these magnetization curves on prototype 
problems of magnet design. 


EXAMPLE 9 A long cylindrical solenoid of radius a = 5 cm is wound with 
n — 200 tums/m of wire carrying current 0.1 A. There is a cylindrical core of 
Mumetal with radius b — 1 cm on the axis of the solenoid. What are H, B, and M 
everywhere? Also, what is the magnetic flux through a cross section of the core, 
and through a cross section of the solenoid outside the core? 

The system is shown in Fig. 9.21. All the fields point in the z direction, by 
symmetry. Outside the solenoid H, B, and M are 0. 

Everywhere inside the solenoid (0 < r < a) we have H — nl — 20 A/m, by 
Ampere’s law. In the air-filled region (b < r < a) B\ = noH — 2.5 x 10~ 5 T, 
and M\ — 0. 

Inside the Mumetal (0 < r < b) we read from the magnetization curve in 
Fig. 9.20 that B 2 — 0.45 T. Note that the solenoid is operating near the saturation 
region of Mumetal. Then M 2 = Z^/mo — H — 3.6 x 10 5 A/m. 

The magnetic flux through a cross section of the Mumetal is <t >2 — 8211 b 1 — 
1.4 x 10 -4 Wb. The flux through a cross section of the rest of the solenoid is <t> 1 = 
B\n(a 2 — b 2 ) = 1.9x 10 _7 Wb. We see that the magnetic flux is overwhelmingly 
concentrated in the magnetic material, even though the Mumetal is only a thin 
stick of a core. This same effect occurs in magnetic shielding. 


EXAMPLE 10 Electromagnet design. Figure 9.22 shows a symmetric electro¬ 
magnet with an iron yoke wound with two identical coils, each with N = 500 
turns carrying current I. What I is needed to produce a field in the gap of B g — 
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FIGURE 9.22 Example 10. A magnetic circuit. 


1.0 T if the gap width is w = 1.0 cm? What / would be needed for the same B g if 
w = 0.5 cm? 

Our strategy will be to apply Ampere’s law to the path abcda. Let H be de¬ 
noted by Hf along the segments ab , be, and cd ; by H g in the gap; and by H' f 
along the part of da not in the gap. Then 

<f U-d£ = (0.30 m)H F + (0.10m- w)H’ f + H g w = 500 7. (9.79) 

Jabcda 

The integral on the left is called the magnetomotive force (mmf) although it is not 
a force because its unit is the ampere. In this expression w is known; Hf, H' F , 
and H g will be found from magnetic properties and symmetry; we seek I. 

In the gap H g = B g /fi o = 8.0 x 10 5 A/m. Because the normal component of 
B is continuous at the gap surfaces, B' F = B g = 1.0 T. Reading from the curve 
for pure iron in Fig. 9.20 we find that H' f = 500 A/m. Because of the bilateral 
symmetry we reason that half the magnetic flux in da comes from each side of 
the yoke, so Bf = \B' F = 0.50T. Reading again from Fig. 9.20 we find that 
Hf = 185 A/m. Substituting these values into (9.79) gives 

(185)(0.30) + (500)(0.10 - w) + 8.0 x 10 5 u; = 500/ 

with winm and / in A. The dominant term on the left is the one from the path 
through the gap; we say that “the gap has much the largest contribution to the 
mmf.” 

For w = 1.0 cm, / must be 16 A. For w = 0.5 cm, half as wide, / must be 
8.2 A. 

In this calculation we tacitly made several simplifications, e.g., neglecting leak¬ 
age of magnetic flux at the gap, and assuming that the field is uniform across cross 
sections of the yoke. It is not hard to imagine that without such simplifications, or 
without the bilateral symmetry, this would be a much harder calculation. 

The general magnet-design problem is to construct a magnet whose field has 
a given configuration. Such magnets are important for many applications, e.g., 
industrial, medical, particle accelerators and detectors, laboratory experiments, 
and magnetic confinement of plasmas. The principles involved are the ones we 
have discussed, but the details required for an application can make the problem 
very difficult. 
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FURTHER READING 

1. J. J. Roche, Am. J. Phys. 68, 438-449 (2000). 

2. The Feynman Lectures, Vol. II, Chapter 34. 

3. Magnetism in matter is a major topic in solid-state physics, discussed in textbooks of 
that subject, such as C. Kittel, Introduction to Solid State Physics, 7th ed. (Wiley, New 
York, 1996). 


EXERCISES 

Sec. 9.1. Atomic Dipoles 

9.1. In the Bohr model the radius of the first Bohr orbit is 0.529 x 10“ 10 m. Calculate 
the orbital frequency if the orbital magnetic moment is 1 Bohr magneton. Compare 
the corresponding electron speed to the speed of light. [Answer: 1/137] 

Sec. 9.2. Magnetization and Bound Current 

9.2. Use a computer to plot the field B{z) on the axis of a uniformly magnetized cylinder 
with M = M ok, as a function of z/a, where the cylinder has radius a and length 3 a. 

Sec. 9.3. Ampere’s Law and H; Susceptibilities 

9.3. Estimate the maximum magnetization (M) of iron. Assume the atomic dipole mo¬ 
ment of an iron atom is due to two aligned (unpaired) electron spins. The mass 
density of iron is 7.87 x 10 3 kg/m 3 , and the atomic mass is 55.85 u. 

9.4. A long cylindrical bar magnet has a uniform magnetization 5 x 10 5 A/m. What is 
B in the middle of the bar? (The result is the same for any cross-sectional shape.) 
What is B at the end of the bar if the cross section of the cylinder is a circle? (Hint: 
See Chapter 8, Exercise 13.) 

9.5. A simple theoretical model of the susceptibility /m of a diamagnetic material, based 
on (9.5), is 

ne^ri 

Xm = 7—% 0 Z, 

where ro is the RMS atomic radius, n is the atomic density of the material, and Z 
is the number of electrons in the atom. (The factor of 2/3 relative to (9.5) is because 
not all electron orbits are orthogonal to the field.) For this model calculate r$ for 
copper, which has mass density 8.92 x 10 3 kg/m 3 and atomic mass 63.55 u. Does 
your result make sense? 

9.6. A simple theoretical model of the susceptibility Xm of a paramagnetic material, 
based on (9.6), is 



Xm = M0« 
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where m is the permanent dipole moment of the atom. Other parameters are as in 
the previous exercise. From this relation calculate m for aluminum, which has mass 
density 2.70 x 10 3 kg/m 3 and atomic mass 26.98 u. Assume rg is 0.5 x 10 -lo m. 
Express the result in Bohr magnetons. Does your result make sense? What should 
be used for the temperature T1 

9.7. The magnetic susceptibilities of the first three noble gases are Xm(He) = —1.1 x 
10~ 9 , Xm(Ne) = —3.9 x 10 -9 , and Xm(Ar) = —1.1 x 10 -8 . The magnetic 
susceptibilities of the first three rare earth elements are Xm(La) = 5.33 x 10 -5 , 
X,«(Ce) = 1.50 x 10~ 3 , and Xm(Pr) = 3.34 x 10 -3 . Explain these results. 

9.8. Consider a large piece of dielectric material that has a constant electric field E| 
and polarization Pj, which are parallel to each other; as always (6.24) holds, D] = 

«oEi + Pi- 

(a) If a thin disk-shaped cavity is hollowed out of the material with the disk axis 
parallel to Ej, what are the fields Eo and Dp in the cavity near the middle of it? 

(b) If a thin needle-shaped cavity is hollowed out of the material with the needle 
axis parallel to Ej, what are the fields Eg and Dg in the cavity near the middle? 

(c) If a small spherical cavity is hollowed out, what are Eo and Do in the cavity? 
Summarize your results by completing the table: 


Cavity shape 

E 0 

Do 

Disk 



Sphere 

Needle 


Dl - |Pl 


This is a combination of Example 1 of Chapter 6 and Example 3 of Chapter 9. It is 
interesting to compare the results of this exercise to the table in Example 3. Why are 
the results symmetric? 

9.9. The magnetic field of a long solenoid, far from the ends, is 0 outside and fiQiiIk 
inside. Prove that this field satisfies the appropriate field equations and boundary 
conditions. 

9.10. Show that if (pm ( x ) is continuous at a surface S, where <p m is some scalar function, 
then the tangential components of \<p m are continuous at S. 

Sec. 9.4. Free Current and Materials 

9.11. Calculate V x H for the field in Example 5, and sketch the field lines. 

9.12. A conducting slab, parallel to the xy plane and extending from z = — a to z = a, 
carries a uniform free current density Jf = ,/q i. The magnetic susceptibility is 0 
in the slab and Xm outside. Determine the magnetic field and the bound current 
distribution. 

9.13. Consider an iron ring (torus) with the radius of the ring R = 0.2 m and the radius 
of the cross section of the ring r = 1 cm. Around the ring there are 1000 turns of 
copper wire carrying 1 A of current. Determine the magnetic fields H and B, and the 
magnetization M. Take the magnetic susceptibility of the iron to be 5.5 x 10 3 . 
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9.14. A long straight wire of circular cross section with radius ro carries current I and 
is immersed in a large volume of water. The wire is nonmagnetic. Calculate the 
magnetization M(x) in the water. (The susceptibility is —9.0 x 10 -6 .) What is the 
bound current density Jb(x) for r > zq? Calculate the total current (free plus bound). 
[Answer: / (I + Xm)J 

9.15. Two thin-walled long coaxial cylinders with radii a and b carry equal but oppo¬ 
site currents ±7 parallel to the axis. The surface current densities are I/2na and 
—I/lttb. Between the cylinders is a material with magnetic susceptibility Xm- De¬ 
termine the field H(r, <j>) and magnetization M(r, <j>). Plot H^ir). Describe in words 
and pictures the bound currents. 


Sec. 9.5. Body in a Magnetic Field 

9.16. Suppose in Example 7 the region inside the sphere has susceptibility xi and the 
region outside the sphere has susceptibility X2- Then solve the boundary value prob¬ 
lem. 

9.17. (a) A long copper cylinder is placed in an applied magnetic field 7t()k parallel to the 

axis, with Bq = IT. Find the magnetic field B inside and outside the cylinder, 
and the bound surface current density Kj,. 

(b) Now suppose the applied field is perpendicular to the axis of the cylinder in the 
x direction, Bq i with Bq = 1 T. Find the magnetic field B inside and outside the 
cylinder if the radius of the cylinder is a = 1 m. (Hint: Use the magnetic scalar 
potential in cylindrical coordinates.) 

Sec. 9.6. Ferromagnetism 

9.18. Figure 9.23 shows a toroidal solenoid with N turns and current /, with an iron core. 
For the magnetic condition of the iron the relative permeability is K m = m/Mo- The 
cross section of the torus is small compared to the radius r. The core has length 


N turns 



FIGURE 9.23 Iron toroid with an air gap. 
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2nr minus a small air gap of length *. Because x is small, we may approximate 
the magnetic field in the torus to be azimuthal around the axis of symmetry with no 
fringing at the gap. The problem is to find the magnetic field Bg in the gap. 

(a) Show from Ampere’s law that LH\+ xHg = N I, where L = 2nr — x and the 
subscripts i and g refer to the iron and the gap. 

(b) Show that because B n is continuous, = K m H\. 

(c) Show that 


B g = 


K m {L+x) 

- DC). 

L + K m X 


where Bq is the field if there is no iron core. 

(d) Suppose K m = 10 3 , r = 10cm, and Nl = 10 3 A. Evaluate Bg for* = 1 mm, 
and for* = 1 cm. What is B in the toroid if the core is iron and there is no gap? 
What is B in the air if there is no core? 


General Exercises 

9.19. (a) Using classical statistical mechanics for a gas of magnetic dipoles (moments mo 

and density n) free to rotate in any direction in a magnetic field B()k, show that 
the magnetization is 

M = nm o (coth b — 1 / b ) k, 

where b = m^Bo/kT and T is the temperature. This result is called the 
Langevin formula. Plot the magnetization M z as a function of b, in units of 
nmQ. Show that for kT 2> moBo the susceptibility approaches nm^ii()/3kT. 
(b) Derive the magnetization assuming instead that the atoms have spin 1/2, i.e., two 
states t and ! with magnetic moments ±pb- 

9.20. (a) If an atom with one unpaired electron spin is placed in a magnetic field Bok 

then there are two energy levels, corresponding to the two values of the quantum 
number M$. Calculate the energy difference A E between the states. 

(b) If the atom is irradiated it will absorb energy if the frequency v is A E / h. This 
phenomenon is called electron spin resonance. Calculate the frequency if Bo is 
2 T. In what part of the electromagnetic spectrum is this frequency? 

9.21. In the Bohr model of the atom an electron circles the nucleus with radius r, speed 
v, and angular momentum L = mvr = h/2n, where h is Planck’s constant. Find 
the magnetic moment in SI units. As a simple model of a ferromagnetic domain, 
assume the atomic moments (with the value just calculated) are aligned and located 
at the vertices of a cubic lattice with lattice spacing 3 Angstroms. Calculate the 
magnetization. 

9.22. Is the following statement true or false? In a uniform material with magnetic suscep¬ 
tibility Xm and electric conductivity 0, any bound current distribution can only be a 
surface current (assume no time dependence). 
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9.23. A long straight wire carrying current / is parallel to the z axis and passes through 
the point (a, 0, 0). The region x > 0 is vacuum, and the region x < 0 is a material 
with magnetic susceptibility Xm ■ 

Show that the H-field in the region x < 0 is the same as H that would be produced 
by a current 21/(2 -f Xm) in the wire with the material everywhere; and the H-field 
in the region x > 0 is the same as H that would be produced by the combination of 
the current / in the wire and a current Xml/C + Xm) along the line parallel to the z 
axis through (—a, 0, 0), with vacuum everywhere. (Hint: Appeal to the uniqueness 
theorem. What are the boundary conditions?) 

Calculate the force per unit length on the wire. 

Computer Exercise 

9.24. Use a computer to plot the magnetic field strength at the center of the sphere in 
Example 8—magnetically shielded by a shell of material—as a function of h/hq. 
Let Bq = 1 T and R 0 /Ri = 1.5. 



CHAPTER 



Electromagnetic Induction 


“I am busy just now again on electro-magnetism, and think I have got hold 
of a good thing, but can’t say. It may be a weed instead of a fish that, after 

all my labor, I may at last pull up.” 

Michael Faraday, prior to his discovery of electromagnetic induction 


So far we have studied only fields that are independent of time. Now we will be 
concerned with electric and magnetic fields, E(x, t) and B(x, t), that vary with 
time. The time dependence produces physical phenomena that are not present for 
static fields —electromagnetic induction and the displacement current. 

Electromagnetic induction is the production of an electric field by a magnetic 
field that varies with time. Its discovery is generally credited to Faraday, in 1831; 
but there is evidence that it was discovered independently in 1830 by Joseph 
Henry. Before the experiments of Faraday and Henry it was known, from the 
work of Oersted in 1820, that a magnetic field is produced by a steady electric 
current, i.e., a current that is constant in time. Therefore Faraday first attempted 
to detect an electric field from a strong steady magnetic field. This idea proved 
to be wrong. But Faraday, who was a skilled experimenter, did notice a different 
effect: When the magnetic field changes in the region of a loop of wire, an electric 
current occurs, as indicated by deflection of a galvanometer needle. The essential 
idea of electromagnetic induction is that an electric field is produced by nonzero 
3B/3f. 

It is natural to ask whether the situation is symmetric: Does nonzero 3E/3f 
produce a magnetic field? The answer, whose consequences will be discussed in 
Chapter 11 in connection with the displacement current, is yes. 

From the scientific discoveries of Faraday and Henry—small, esoteric effects 
at the time of their discovery 1 —technology has developed that has changed the 
human condition. Electromagnetic induction is the basis for the generation of 
electricity that we all use in daily life. The goal of this chapter is to describe the 
mathematical theory of electromagnetic induction and some of its applications. 

1 At a public lecture, Faraday was asked: “Mr. Faraday, of what use are your discoveries?” His answer: 
“Madam, of what use is a baby?” 
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10.1 ■ MOTIONAL EMF 

As a preliminary step it is useful to study motional EMF —the EMF due to motion 
of a conductor in a magnetic field. 

10.1.1 ■ Electromotive Force 

The most general definition of electromotive force (EMF) is that the EMF £ along 
a curve T due to a force F acting on charges is the work per unit charge that would 
be done by F on a test charge q that moves along T; i.e., £ = / F • dl/q. The 
term “electromotive force” is a misnomer, because £ is not a force; it is the work 
per unit charge, with units of volts. So, for example, the “EMF of a battery” is the 
energy per unit charge supplied by the battery to transport positive ions from the 
cathode to the anode within the battery, or equivalently the work per unit charge 
done by the electric field on an electron that moves around the wire from the 
cathode to the anode. 

In this chapter we are concerned more specifically with circuital EMF, i.e., 
EMF for a closed curve, or loop. From now on, EMF will mean electromotive 
force around a circuit. Again, the definition is the work per unit charge on a test 
charge that is carried around the circuit, 

f F dl 

£=& —, ( 10 . 1 ) 
Jc q 

where F is the force on q. Here C may be any closed curve in space, not neces¬ 
sarily occupied by a conducting material. 

The general electromagnetic force on a test charge q is the Lorentz force 
F = q(E + v x B). In this first section we are mainly concerned with the con¬ 
tribution to the EMF (10.1) that comes from the magnetic term q\ x B in F. 
This contribution is called the motional EMF, which may be nonzero if the circuit 
moves in a magnetic field. 

The electric term q E in F needs a little explanation now, in anticipation of a 
much fuller discussion later. There are two sources for E, and they play different 
roles with respect to EMF. The two sources are electrostatic, due to charges at 
rest or at least moving slowly enough to make the quasistatic approximation, and 
inductive, due to 3B/31 . There is no EMF around a closed loop associated with 
the electrostatic field because V x E = 0 for static charges; by Stokes’s theorem, 

<j> E • dl = J V x E • dA = 0. (10.2) 

Indeed, generally, the EMF from any conservative force is 0, because a conserva¬ 
tive force has, by definition, § F • dl = 0. There is however an EMF associated 
with the inductive part of E, which, as we shall see, has V xE/0. 

But first we ask: What is the EMF from the magnetic force when the circuit 
moves in a magnetic field? 
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10.1.2 ■ EMF from Motion in B 


Consider a rigid wire loop C moving in a magnetic field. What is the EMF £? 
Suppose the magnetic field B(x) is static in time but nonuniform in space. The 
magnetic force on a test charge q in the wire is q\ x B, where v is the macroscopic 
velocity of the point in the wire where q is located. The EMF around the loop is 
therefore, by the definition (10.1), 

£ = j> (v x B) • dl = <j) B ■ {dl x v), (10.3) 

using a vector product identity in the second equality. The subsequent analysis 
is based on Fig. 10.1, which shows the loop C at times t and t + St. (The limit 
<5r —x 0 is understood.) The surface swept out as C moves, denoted <5F, is shaped 
like a loop of ribbon. Note that dl x \ is dlx Sx/St, where <5x is the displacement 
of dt during St; from Fig. 10.1, di x Sx is the outward directed area element of a 
section of the ribbon. Thus the right-hand side of (10.3) is 8<t> r /8t, where 5<£ r is 
the flux of B through the ribbon surface 5F in the direction defined by dl x 8x. 
Now consider the closed surface composed of <5F and surfaces S(t) and S(t + 8t) 
bounded by C(t) and C(t + 8t) respectively: By Gauss’s theorem the flux of B 
outward through the closed surface is 0, so 

5cfi r + f B ■ dA — f B-dA = 0. (10.4) 

JS{t+dt ) JS(t) 

The sense of dA on S(t) is correlated with that of dl by the right-hand rule. With 
the fingers curling in the direction of dl the thumb points in the direction of dA. 
This direction is outward for the surface Sit + dt) but inward for S(t)\ hence the 
minus sign in the third term of (10.4). Finally, then, since <5<t> r = £8t by (10.3), 


£ 


d 

dt 


/ > 

JS(t) 


B dA 


d<t> 

" dt ’ 


(10.5) 


where $ is the magnetic flux through the surface S bounded by C. The dimensions 
of magnetic flux $ are T m 2 , or webers (Wb). Equation (10.5) is the formula for 



FIGURE 10.1 Motional EMF. The surface 8 E (a ribbon loop) is swept out as the circuit 
C moves between times t and t + 8t. Surfaces S(r) and S(t + St) are bounded by the curves 
C(t) and C(t + 8t), and the union of S(t ), S(t + 8t ) and <5Z is a closed surface. For the 
section of 5E shown shaded, the directed area element is dl x <5x. 
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FIGURE 10.2 Motional EMF. B points out of the page in the shaded region. As the 
conducting loop moves out of B there is a counterclockwise induced EMF equal to v Bw, 
which is —d<S>/dt where $ is the flux of B through the loop. 


motional EMF around a circuit moving in a magnetic field. The flux O(f) varies 
in time if the wire loop C(t) moves through a nonuniform field. 

We derived (10.5) in a general but abstract way. We’ll understand the result 
better if we verify it for a simple specific example. Figure 10.2 shows a rectan¬ 
gular loop of size L x w moving with velocity v i out of the region between the 
rectangular pole faces of a magnet. We may approximate the B field as uniform 
in the region between the pole faces, and zero outside. Figure 10.2 shows the 
configuration at t = 0. Assume the linear size of the magnet is larger than w. 
The motional EMF is £ = vBw by (10.3), because only the left end of the loop 
(which is in B) contributes to the line integral. As a function of time t, the length 
of the loop between the pole faces is L — vt, so the magnetic field flux through 
the loop is O = B(L — vt)w. The rate of change of O is d<t>/dt = —Bvw. Hence 
S = —d<t>/dt, in accord with the general result. 

Equation (10.5) was derived for motion of a rigid loop. But the same equation 
would hold if the conducting loop were to change in ways other than by transla¬ 
tion, such as expansion or contraction, or a change in shape. 

10.1.3 ■ The Faraday Disk Generator 

As another example of motional EMF, Fig. 10.3 shows a copper disk spinning in 
a uniform external magnetic field B = Bk. It is natural to describe the system in 
cylindrical coordinates (r, <p , z) with the z axis orthogonal to the disk. Fet a be the 
radius of the disk, and let oj = <uk be the angular velocity. There is a conducting 
wire, fixed in space, with sliding contacts at the rim and center of the disk. A 
current I will flow, in the sense shown, through the circuit composed of the disk 
and the wire, driven by the motional EMF. The EMF from the center to the rim is, 
by (10.3), 


£ = 


' 1 , 

corBdr = -coBa . 


( 10 . 6 ) 
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FIGURE 10.3 Faraday disk generator. A conducting disk rotates in a magnetic field. 
Because of the Lorentz force on mobile charges, a charge separation (+ at the rim, — at 
the center) would develop if there were no wire. The conducting wire connecting the rim 
and the center completes the circuit, and a current flows. 


By Ohm’s law the current is 


I = 


(t)Ba 2 
2 R ' 


(10.7) 


where R is the total resistance in the circuit. We can produce large currents with 
such a device, if the radius and angular velocity are large. The power dissipated 
by resistance must be replaced by the agent that keeps the disk spinning. If the 
disk is not driven then it will slow down and stop, a phenomenon called magnetic 
braking. The magnetic force on the current in an area element dA of the disk is 
dF = (K dA) x B, where K is the surface current density and dA = rdrdcp. 
K is essentially radial from the center to the rim. Thus the direction of dF is 
? x k = —<p, opposite to the velocity of dA, and so the external field acts as a 
brake. The torque on the disk is 


N = 



r x dF = — B 


/ 


r • K rdrdcp. 


( 10 . 8 ) 


(The double cross product has been reduced in the second equality.) The az¬ 
imuthal integral /q ' t K r rd<p is the total current I through a circle of radius r 
on the disk, and I is independent of r by charge conservation. The radial integral 
gives 


N = -flk 



Irdr = — 


IBa 2 - 

-k 


(10.9) 


The direction of N is opposite to the angular velocity &>k, so N is a braking torque. 

Faraday used this device as a very basic demonstration that when a conductor 
moves through magnetic lines of force, a current is induced. 


Self-Excited Dynamo 

We consider next an interesting modification of Fig. 10.3, in which the station¬ 
ary wire from the rim of the disk back to the axle is replaced by a stationary 
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solenoidal coil through which the induced current I flows. The coil is placed near 
the rotating disk, and has the same axis as the disk. Then the magnetic field due 
to I is orthogonal to the disk’s plane. The coil is wound in the sense such that its 
current I produces a magnetic self-field B se if that is in the same direction as the 
external field B. Then, once current starts to flow, the external field may be turned 
off, and if the generator is driven it will continue to produce an EMF. 

In principle, even if there is no external B, a generator driven at a large enough 
angular frequency co > o> cnt j Ca i, which depends on the geometry and the elec¬ 
trical resistivity of the system, will produce a B se if large enough to produce an 
EMF and current flow. Such a system may be called, even in the strictest sense, 
a self-excited dc-generator, or a dynamo. In practice, for a system of the kind 
just described, made only of copper, a> C riticai would be very large. If, however, the 
solenoidal coil is wound around a ferromagnetic core, then a self-excited gen¬ 
erator becomes practical. Generators in common use with self-generated fields 
have a ferromagnetic core that retains some magnetization if the system stops. 
Then when the system is restarted the magnetic field of the core serves the same 
purpose as an external field. 

To calculate o> cr iticai is difficult. However, dimensional analysis suggests that 
^critical is of order K \p/(i 2 [i o)], where AT is a dimensionless constant, p is the 
resistivity of copper, and € is a characteristic linear dimension of the system. For 
laboratory-size systems, t is the size of the apparatus, and tu C riticai is ver y large, 
practically unattainable. For the Earth, however, where t is much larger, o> cr iticai 
is small enough that Earth’s magnetic field is due to a self-excited dynamo. 

The terrestrial self-excited dynamo that is generally accepted to be the origin 
of the Earth’s magnetic field, is due to currents in the Earth’s outer core—a con¬ 
ducting liquid—which extends from about 0.2 /?Earth to 0.55 /?Eaith- This core is 
molten Fe, with a few percent Ni, at about 4000° C, far too hot for any ferro¬ 
magnetism. The precise details of the current distribution are unknown, although 
model calculations can be made. 2 

Even starting with a small applied external magnetic field, it is very difficult 
to construct a laboratory-scale fluid dynamo. The first successful demonstration, 
which used molten sodium as the conducting liquid and took 25 years to achieve, 
has been reported. 3 


10.2 ■ FARADAY'S LAW OF ELECTROMAGNETIC INDUCTION 

We have seen that a conducting loop moving in a static but nonuniform magnetic 
field experiences an EMF. For example, in Fig. 10.2 the nonuniformity is that B 
is confined to a finite region, out of which the conducting loop is being pulled. In 
such a situation, the conductor moves and the field is at rest. By the principle of 
relativity we should expect the converse phenomenon to occur: If the conductor 
is at rest, and the field moves, there should again be an induced EMF. After all, 

2 D. R. Stump and G. L. Pollack, Am. J. Phys. 66, 802-810 (1998). 

3 A. Gailitis, et al„ Phys. Rev. Lett. 84, 4365 (2000). 
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10.2.1 



FIGURE 10.4 Demonstration of electromagnetic induction. When the magnet moves 
with respect to the solenoid, the galvanometer needle deflects. 


the principle of relativity implies that one cannot determine an absolute state of 
motion. Who is to say which is at rest—the conductor or the magnet? Indeed a 
moving magnet does produce an EMF around a nearby circuit. An even stronger 
statement is true. If the magnetic field changes in time for any reason then there 
is an induced EMF from an electric field. This phenomenon is called electromag¬ 
netic induction. 

A simple demonstration of electromagnetic induction is illustrated in Fig. 10.4. 
A bar magnet is moved into or out of a coil of wire, which is connected to a 
galvanometer. When the magnet moves, a current flows in the solenoid, driven by 
the induced EMF, causing deflection of the galvanometer. This demonstration is 
not exactly the way Faraday first observed electromagnetic induction, but Faraday 
did similar demonstrations. 

Note the similarity and difference between electromagnetic induction and mo¬ 
tional EMF. We obtain a current either by moving the coil with the magnet at 
rest (motional EMF), in which case the force on the charges in the current is 
magnetic; or, we obtain a current by moving the magnet with the solenoid at rest 
(electromagnetic induction), in which case the force is electric. (In the latter case 
the force cannot be magnetic because the conductor is at rest; a magnetic force, 
which would be proportional to v x B, would be 0.) These are two aspects of 
the same phenomenon. In Chapter 12, on relativity, we will study in more de¬ 
tail the close relation between electric and magnetic fields in different frames of 
reference. 

Mathematical Statement 

A qualitative statement of Faraday’s Law is that a changing magnetic field creates 
an electric field. A quantitative statement, found in introductory textbooks, is the 
relation 


£ = 


d<P 

Hi ’ 


( 10 . 10 ) 


where £ is the EMF around a closed curve C due to the induced electric field, and 
<J> is the flux of magnetic field through any surface with boundary C. Equation 
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(10.10) is the same as the equation (10.5) for motional EMF, as we have antici¬ 
pated by considerations of relativity. The flux <t> can change for various reasons: 
The field may change with the loop fixed; the position, orientation, or shape of the 
loop may change with B fixed; or some combination of these changes may occur. 
In any case, (10.10) holds. 

Equation (10.10) is an integral relation between the line integral £ and the 
area integral <t>. An equivalent, but more powerful statement of Faraday’s law, is 
a local relation between partial derivatives of the fields E(x, t) and B(x, t ), which 
may now depend on time. We can derive the differential relation from (10.10) by 
Stokes’s theorem. The EMF around C due to the induced electric field is 

£=<f>E-d£ = J VxE-rfA. (10.11) 

Again, C is an arbitrary closed curve fixed in space, not necessarily occupied by a 
conductor. Faraday’s Law is really a statement about the fields, not about current, 
although it was first discovered by observing the current produced in a conductor 
by the induced E. The second equality in (10.11) is just Stokes’s theorem, and S 
may be any fixed surface bounded by C. In electrostatics £ would be 0, because 
an electrostatic field is irrotational, i.e., has curl equal to 0. But the electric field 
produced by electromagnetic induction is not irrotational. 

Now, the right-hand side of (10.10) is 

d<t> r 3B 

— =- d A. (10.12) 

dt J s dt 

We are justified in pulling the time derivative into the integral because the surface 
S is fixed in space. The surface integrals in (10.11) and (10.12) must be equal for 
any surface S, because the original loop C is arbitrary. The only way these two 
integrals can be equal for all surfaces is if the integrands are equal, 


V 


x E = - 


3B 

a7‘ 


(10.13) 


Equation (10.13) is Faraday’s Law. It is one of the universal equations of elec¬ 
tromagnetism. It relates the variation of E in space to the variation of B in time. 
Or, there is another way to regard the meaning of this equation. It says that there 
exists another source of electric field besides electric charge: A magnetic field 
changing in time is a source of electric field. This source is not like charge, from 
which E diverges, but rather like a “current” around which E curls. Of course 
it is not a charge current, but a magnetic-field current. Static charge creates an 
irrotational field. Nonzero 3B/3 1 creates a curling field. 

The Helmholtz theorem (Appendix B) states that a vector function is deter¬ 
mined by its divergence and curl with suitable boundary conditions. In the case 
of E the divergence is V • E = p/eo, which involves charges; the curl is V x E = 
—3B/3 1, which involves electromagnetic induction. 

The reader may feel that while (10.10) and (10.13) may seem plausible, based 
on the similarity to motional EMF and the relativity principle, the relation has 
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not been derived with logical necessity. But of course it cannot be derived from 
more fundamental principles, because no principle of electromagnetism is more 
fundamental than Faraday’s Law. Faraday’s Law, in either of the equivalent for¬ 
mulations (10.10) or (10.13), is an empirical observation, based on Faraday’s ex¬ 
periments, and is the most basic relation between time-dependent fields. 

10.2.2 ■ Lenz's Law 

Lenz’s law is a very useful statement about the direction of the induced E in 
electromagnetic induction. It is a corollary of Faraday’s Law. 

Faraday’s Law is a statement about the fields. The direction of the induced E 
follows from (10.13) or (10.10): If the thumb of your right hand points opposite 
to the direction of the change of B, then the fingers curl in the direction of E. 

Lenz’s law is a statement about the direction of the induced current if a con¬ 
ductor is present: If a conducting loop forms a closed curve C in a changing 
magnetic field, then the direction of the induced current opposes the change of B. 
Let’s check that this statement is right. Let the thumb of your right hand point 
opposite to the change of B. The fingers then curl in the direction of the current 
driven by the induced E; the magnetic field produced by that current is in the di¬ 
rection of your thumb (by Ampere’s Law) opposite to the change of B. Thus the 
induced current opposes the change of B. In other words, the current induced by 
a changing B field flows in the direction that tends to keep the flux from chang¬ 
ing. The principle is illustrated in Fig. 10.5. But note that the field of the induced 
current does not cancel the change of B; the flux is changing. 

Lenz’s law is helpful when we need to figure out the direction of the induced 
electric field. Of course we can always derive the direction by a very careful ap¬ 
plication of Stokes’s theorem, remembering that the directions of dA of S and di 


conducting loop 


induced current 


3B/3t 


B: of induced current 



FIGURE 10.5 Lenz’s law. C is a conducting loop and S is a surface bounded by C. 
As the magnet moves toward C, the flux through 5 increases, and the field at a point on 
S changes at the rate dB/dt. While the flux is changing the induced current opposes the 
change by making a field Bj opposite to 3B/3t. 
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of C are related by the right-hand rule. But it is usually easier to apply Lenz’s law, 
by imagining what would happen if a conductor were present. 


EXAMPLE 1 Suppose the circular pole faces of an electromagnet are above and 
below the xy plane, and parallel to the xy plane, so that the magnetic field is in 
the +k direction. (The north pole is below and the south pole above the plane.) 
What is the electric field in the xy plane if the magnitude of B is increasing in 
time? In particular, what is the direction of E? 

We can solve this problem by the “Amperian loop trick.” The problem has 
cylindrical symmetry. The direction of the electric field must be azimuthal 
(curling around — 3B/3f) and its magnitude cannot depend on 0; i.e., the field 
E(x, y, 0) has the form E(r)<j). Applying (10.10) to a circular loop of radius r, 
we find 


UD n 

2 TirE =- 7ir , (10.14) 

dt 

where B is the average of the field over the area of the circle. (The definition of 
the average field is B = (/ B • dA) /nr 1 -, if the field is uniform then B = B.) 
Thus the electric field is 

r dB « 

£ = (10.15) 

According to (10.15), E(x) is in the direction —<p if B z is increasing in time. This 
direction is in accord with Lenz’s law: The change of B is in the direction of +k. If 
C were occupied by a conductor, then current in the direction —(f) would produce 
a magnetic field in the —k direction by Ampere’s Law, i.e., opposing the change 
of B. 


10.2.3 ■ Eddy Currents 

Currents that are induced in conductors, especially conducting plates, are called 
eddy currents. Figure 10.6 illustrates the eddy currents associated with magnetic 
braking of a Cu plate swinging as the bob of a pendulum in a magnetic field, a 
common classroom demonstration. 


EXAMPLE 2 Magnetic braking. If there is no magnetic field, the plate, swing¬ 
ing as a pendulum from a suspension point high on the z axis, oscillates freely 
in the x = 0 plane. But if a magnetic field Bo is applied in the shaded region of 
Fig. 10.6, perpendicular to the plate, the motion is damped. Suppose Bo points 
in the +x direction, as shown. This may be arranged by letting the plate swing 
between the poles of an electromagnet, with the N-pole at x = — Axq and the 
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FIGURE 10.6 Eddy current demonstration. B points out of the page in the shaded 
region. As the Cu plate swings into or out of the magnetic field, the induced eddy currents 
oppose the change of flux. 


S-pole at x = +Axo. As the plate swings into the magnetic field region from the 
right, eddy currents are induced in the direction shown, clockwise by Lenz’s law. 
The induced magnetic moment m of the plate then points in the —x direction. 
At this stage of the motion the plate is repelled from the field region because of 
the force on the induced current exerted by the magnet. If the plate swings all the 
way through the field region, then counterclockwise eddy currents are induced as 
it leaves the region. At that stage the induced m is in the +x direction. The plate 
is attracted to the field region because of the force on the induced current exerted 
by the magnet. 

Both while entering and while leaving the field, the induced eddy currents 
experience a force braking the plate’s motion. This is the principle of magnetic 
brakes, which are used to stop circular motion of conducting disks, and are applied 
in, e.g., circular saws, lawn mowers, and high-speed trains. 

In magnetic-braking applications it is the conducting disk whose motion in 
a stationary magnetic field is damped. As we know, however, only the relative 
motion between conductor and field is relevant. Therefore, if one rotates a perma¬ 
nent magnet in front of a stationary conducting disk—we imagine a bar magnet 
rotating in a plane parallel to the disk—then the interaction of the bar magnet 
and the induced eddy currents will result in a drag torque that depends on the 
rotation frequency. This effect is applied in torque transmission equipment and in 
tachometers. 
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EXAMPLE 3 Asa quantitative example of magnetic braking due to eddy currents, 
consider a thin conducting rectangular frame released from rest that falls from a 
region of constant magnetic field normal to the frame into a region of zero field. 
We’ll show that its fall is held back by eddy currents. 

The frame and magnetic field are shown in Fig. 10.7. Call the long, vertical 
dimension of the frame i, and its width w. The horizontal line is the boundary of 
the magnetic field. As long as the entire frame is within the magnetic field, the 
flux through it is constant, there is no net EMF around it, no current flows, and 
there is no magnetic braking, so the frame behaves like any other falling body. 

But what is the subsequent motion if the frame is released from rest, at t = 
0, in the position shown in Fig. 10.7? Let downward be the positive direction. 
Counteracting the gravitational force Mg, there will be a magnetic braking force 
opposing the loss of flux through the frame as it moves downward out of the 
field region. If v is the downward velocity then there is a magnetic force on the 
electrons along the side ab, creating a current. According to (10.3) the EMF is 
£ — vBw. If we take B to point out of page then the induced current is I = 
vBw/R, where R is the resistance of the entire conducting path abcda, flowing 
in the counterclockwise direction around the frame. The magnetic force acting on 
the current in ab, the braking force, is upwards, and is given by 

B 2 w 2 v 

^braking — IwB = — , 

K 

negative implying upwards. Thus the equation of motion of the frame is 

dv B 2 w 2 

M—=Mg - v, (10.16) 

at R 

where M is its mass. To simplify the solution to (10.16) we use the notation 
gM R/(B 2 w 2 ) = Uoo and rewrite (10.16) as 



c 


d 


FIGURE 10.7 Falling frame. B points out of the page in the shaded region. The con¬ 
ducting frame (abed) falls in Earth’s gravity. The induced current and the braking force are 
calculated in Example 3. 
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dv 

dt 


gv 

— = g 

v oo 


(10.17) 


In this form we see that u<x, is the terminal velocity, i.e., the constant downward 
velocity the frame would reach if it could fall until it reached the steady state 
dv/dt = 0. 

Equation (10.17) is a first-order linear differential equation with constant coef¬ 
ficients. Its solution can therefore be written as a sum of a particular solution and 
a solution to the homogeneous equation: v(t) = v p + Ub = Vqc + C exp(— gt/voo), 
where C is determined from the initial condition u(0) =0. Finally then 


v(t) = v co (l-e s‘/voo) 


(10.18) 


v(t) starts from 0, and approaches Vqc exponentially. In a real experiment the 
frame might not reach terminal velocity. After it has fallen entirely out of the 
field, it will again fall freely. If a piece were cut out of the frame, i.e., if there 
were an insulating break in the path abcda, then there would be no current around 
the loop so the braking effect wouldn’t occur. 

In this example we used a rectangular frame because the eddy current for that 
case is easy to calculate. But it’s natural to ask also what would happen if instead 
of a frame the conductor were a thin sheet, as used in lecture demonstrations and 
described in Example 2. In that case there would also be induced eddy currents 
and a concomitant magnetic braking effect. However, the current flow is more 
complicated, and it can only be calculated by advanced analytical methods. 


EXAMPLE 4 Magnetic levitation. A potential future application of eddy cur¬ 
rents is in mass transport. The idea is to use the repulsion between induced cur¬ 
rents and magnets to levitate vehicles, with the resulting advantages of lower 
friction and less noise. Magnetic levitation has been used for trains in Germany, 
Japan, and England, but the cost is so high the future of this technology is in 
doubt. 

We already saw in connection with Fig. 10.6 that eddy currents produce repul¬ 
sive and drag forces when there is a relative sideways motion between magnets 
and conductors. Figure 10.8 will help to illustrate further some principles related 
to magnetic levitation. Imagine that the bar magnet in Fig. 10.8 is lowered from 
above into the position shown. If the plate is a good conductor then the eddy cur¬ 
rents induced in it will repel the magnet and tend to keep it suspended. If the 
magnet is now moved sideways, say to the right as indicated by the arrow, then 
new eddy currents produced at the front of the moving magnet will tend to keep 
it suspended. There will also be drag forces due to eddy currents in back of the 
magnet, so that a levitated vehicle, here represented by the magnet, must have a 
source of propulsion if it is to move. 

The development of high-temperature superconducting materials has increased 
the interest in magnetic levitation, because they may offer the possibility of pro- 
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FIGURE 10.8 Example 4. Magnetic levitation. 


ducing zero-resistivity materials with much lower refrigeration requirements 
compared to low-temperature superconductors. Interactions between supercon¬ 
ductors and magnetic fields are fascinating. Superconductors can act as perfect 
diamagnets, in the sense shown in Fig. 10.8, as long as the magnetic field is less 
than a critical value. In these materials, surface currents are induced that prevent 
the magnetic field from penetrating the bulk. In fact, if there is a magnetic field 
inside a metal in its normal state, then it will be expelled if the metal becomes 
superconducting as the temperature is lowered. This is a quantum-mechanical 
phenomenon called the Meissner-Ochsenfeld effect. This behavior does not occur 
in non-superconducting metals. In a non-superconductor, any magnetic field flux 
that has penetrated the bulk will remain in it as the temperature is lowered and 
the resistivity decreases. If one tries to remove the field from inside the metal 
at low temperature, say by turning off the source, then induced currents will 
flow according to Lenz’s law, in the sense so as to maintain the field inside. The 
study of magnetic properties of superconductors is closely bound up with their 
atomic structure and electric properties, and it is likely to remain an active field 
of research for a long time. 


10.3 ■ APPLICATIONS OF FARADAY'S LAW 

Faraday’s Law is not just an abstract field equation. It describes a physical 
phenomenon—electromagnetic induction—that is one of the most important in 
technology. Here we will look at three practical applications. 
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10.3.1 ■ The Electric Generator and Induction Motor 

Faraday’s Law is the principle at work in an electric generator. The essential de¬ 
sign is a conducting coil rotating in the magnetic field of a fixed magnet, as illus¬ 
trated in Fig. 10.9(a). For constant angular velocity the magnetic flux through the 
coil area A is 


O = J B ■dA = BA coscot (10.19) 

because the angle between B and dA is cot as the coil spins. Thus the induced 
EMF around the coil is, by (10.10), 

S = BAcosmcot. (10.20) 

Figure 10.9(a) illustrates a basic generator. The magnetic field is supplied by a 
stationary permanent magnet, the stator. The coil, or rotor, shown as a single loop 
of wire, is rotationally driven with angular velocity co by an external source of 
power. An alternating EMF (10.20) of angular frequency co is produced. The EMF 
is led to the outside via conductors on the rotation shaft, and eventually supplied 
to, say, an electric socket. In the position shown, the induced EMFs across the 
upper and lower segments of the coil drive currents in the indicated directions, 



FIGURE 10.9 Generators and motors, (a) Principle of the electric generator; (b) three- 
phase generator or induction motor. 
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in agreement with Lenz’s law. 4 In Fig. 10.9(a), when the plane of the loop is 
horizontal (i.e., perpendicular to B) the induced EMF is zero, because at that 
instant 34>/3 t = 0. When the coil is vertical (i.e., parallel to B) the induced 
EMF is maximum. After the plane of the coil has turned through 180 degrees, the 
current direction in each segment is reversed. All of this behavior follows from 
(10.13) and Lenz’s law. The magnetic moment m induced in the loop is in the 
direction such that the torque on the loop N = m x B is in the direction opposite 
to uj. (In Fig. 10.9(a), w is into the page and m x B is out of the page.) Power 
—N • cj is required to drive the generator at constant w; that power is expended as 
7 2 R and mechanical losses in the load connected across the electric socket. 

Motors and Generators: A Symmetry. Figure 10.9(a) can also be used to il¬ 
lustrate a basic ac-electric motor. There is a general equivalence—a symmetry— 
between motors and generators, related to conservation of energy. In running the 
system as an electric motor, alternating current at frequency co is supplied to the 
coil from outside via conductors along the rotation shaft When the motor coil is 
in the position shown in Fig. 10.9(a) the current flows opposite to the directions 
shown for the generator mode, so that the torque is now parallel to cj and the 
coil is driven in the direction shown. When the coil passes through horizontal, the 
direction of the supplied current must reverse for the torque to remain parallel to 
cj. In the motor, power P = IV is supplied as current / at voltage V ; that power 
is expended as mechanical work done by the motor. In general only small motors 
use permanent magnets as the source of the magnetic field. 

Induction Motors. Most large electric motors are induction motors, in which 
the source of the magnetic field is alternating current in a stator. This current is 
the means by which the external power to run the motor is supplied. The stator 
consists of several stationary electromagnets whose poles are arrayed at axially 
symmetric positions around the rotation axis of the motor. The spatially and tem¬ 
porally varying magnetic field of the electromagnets—a rotating field —induces 
time-dependent EMFs in the coils of the rotor in the sense to drive the rotor at 
a frequency equal to an integer multiple of the frequency of the supplied current 
in the electromagnets. Figure 10.9(b) shows a schematic diagram of a three-pole 
induction motor. There are iron cores in the electromagnets, which increase the 
magnetic field by a factor of order 10 4 compared to what copper coils alone would 
provide. The rotor is made up of many coils of wire. The current in the rotor coils 
is an induced current; there is no contact between the coil wire and the outside! 
In a real motor the gap between the outer surface of the rotor and the stator poles 
would be small and the number of poles would range from 2 to 100. 

Most industrial distribution systems supply electricity to run induction motors 
in the form of three-phase power. To obtain three-phase power one uses, for ex¬ 
ample, a generator like that in Fig. 10.9(a), but with three loops, or three coils, 

4 Lenz's law gives the correct current direction in the loop; however, it is easier in this case to figure 
out the current direction from the direction of the magnetic force q\ x B. 
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each with its own feedout, making angles of 120 degrees with each other, on the 
same rotation shaft. The output of such a generator is the superposition of three 
EMFs with the harmonic form of (10.20); but each is shifted in phase by one-third 
of a cycle, 120 degrees, with respect to the others. This output can be applied to 
energize the three electromagnets of Fig. 10.9(b) so that the magnetic field rotates 
and the coil rotates with it. The principles of polyphase ac-induction motors and 
rotating magnetic fields were invented by Nikola Tesla in the 1880s and first ap¬ 
plied commercially by the Westinghouse Electric Company. Common motors of 
this kind range from very small ones in servomechanisms, whose power is a frac¬ 
tion of a watt (10 4 hp), to large motors used in pumps and wind tunnels, whose 
power is several thousand hp. 

10.3.2 ■ The Betatron 

A betatron is an electron accelerator that uses the electric field induced by a 
changing magnetic field to accelerate electrons to relativistic energies. (In nu¬ 
clear physics an electron is called a beta particle, because electrons are products 
of some beta decays.) The first betatron was built by Kerst in 1940, for electron 
scattering experiments in nuclear physics. Betatrons are still used today to pro¬ 
duce hard X rays for studies of solid-state structure, and for medical purposes. 

A schematic diagram of a betatron is shown in Fig. 10.10. Electrons circulate 
in a toroidal vacuum chamber of radius ro between the poles of an electromagnet. 
The magnetic field in the plane of the torus is B(r, r)k. What is the condition 
on B(r, t) such that the radius ro of the electron orbit remains constant as the 
electrons gain kinetic energy? 



FIGURE 10.10 Betatron. The electrons travel in the toroidal vacuum chamber at radius 
ro- The magnetic field strength increases from 0 to a large value B max during the acceler¬ 
ation phase. As B z increases an azimuthal induced electric field raises the electron kinetic 
energy. The pole shape is designed such that the radius of the electron orbit remains con¬ 
stant as the electrons gain kinetic energy. The betatron has been likened to a slingshot: B 
ramps up to B max and then the electrons are released to the target. 
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For circular motion with radius r o the momentum is p(?) = p(t)<p, where 
the magnitude of the momentum is p(t) = eroB(ro, t), just as in a cyclotron 
discussed in Chapter 8. But in the betatron, the magnitude of the momentum in¬ 
creases because of the force exerted by the induced electric field. By Newton’s 
second law and (10.15), 


dp 

dt 


= -eEd, = — 


era dB 


2 dt ' 


( 10 . 21 ) 


where B is the average of B ? over the area inside the orbit. Comparing the two 
relations between p and B shows that the condition for ro to be constant is that 

B(r 0 ,t)= \B{t). (10.22) 

That is, the pole shape must be designed in such a way that the vacuum ring is at 
the radius where the field strength is B/ 2. 

A betatron can be used to accelerate electrons up to energies of a few hundred 
MeV. For higher energies it is difficult to get the large average magnetic fields that 
are required; also, high-energy electrons radiate a lot of electromagnetic energy, 
so that (10.21) no longer holds. Therefore to reach higher energies, synchrotrons 
or linear accelerators are necessary. 


10.3.3 ■ Self-Inductance 

An inductor is a circuit device that stores energy in a magnetic field. In this section 
we will consider self-inductance of a conducting loop. 

If a conducting loop carries current I then the magnetic flux d> through the 
area bounded by the loop is, by the Biot-Savart law, proportional to I : 

$ = LI. (10.23) 

(This assumes any magnetic materials in the system are linear.) The constant 
of proportionality L, called the self-inductance, depends on the geometry (size 
and shape) of the loop, but not on the current. The SI unit of inductance is the 
henry (H), which is equal to T m 2 /A, or Wb/A. 5 

If / changes then there is an induced EMF £ around the loop by Faraday’s 
Law, 


„ d<S> dl 
£ = -— = -L — . 
dt dt 


(10.24) 


This EMF is called a back EMF because by Lenz’s law it opposes any change 
of/. 


5 Wb, for weber, i s a unit of magnetic flux, equal to T m 2 . 
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EXAMPLE 5 What is the self-inductance of a long densely wound cylindrical 
solenoid? 

Suppose the solenoid has N turns of wire, length £, and cross section A. From 
magnetostatics the magnetic field in the solenoid is B = poN I/l in the direction 
parallel to the axis. The magnetic flux through the area enclosed by one loop of 
wire is BA, and the total flux, summed over all loops of the solenoid, is <J> = 
BAN. Therefore the self-inductance is 


<J> p Q N 2 A 

7 ~ e 


(10.25) 


Equation (10.25) is the inductance for a cylindrical solenoid with any cross sec¬ 
tional shape—it need not be circular. The self-inductance per unit length L/i of 
a long solenoid, denoted L', is pon 1 A, where n = N jl is the number of turns per 
unit length. 


EXAMPLE 6 What is the self-inductance of a closely wound toroidal coil with 
rectangular cross section? 

Figure 10.11 shows a top view and a cutoff view of the toroidal coil. Each 
loop has height b, and extends from the inner radius R to the outer radius R + a. 




FIGURE 10.11 Example 6. The toroidal solenoid shown in a top view and a cut-away 
side view. The dashed circle is an Amperian loop used in the analysis. 
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For convenience, take the xy plane to be the median plane of the toroid, pass¬ 
ing horizontally through the centers of the rectangular windings. The z axis goes 
vertically through the center of the torus. N is the total number of windings. 

Recall that in Example 6 of Chapter 8 we learned that the magnetic field inside 
a long straight solenoid is uniform B = /xonlk , and outside it B = 0, for any cross 
sectional shape as long as it is constant along the entire length of the solenoid. A 
similar symmetry, which is, however, more difficult to analyze, holds for a toroidal 
solenoid. 

For a toroidal solenoid, symmetry requires that inside the solenoid the mag¬ 
netic field have the form B = B^(r)(p, and outside it B = 0, for any cross sec¬ 
tional shape as long as it is the same all the way around the solenoid. In this case 
we can obtain the field by applying Ampere’s Law to the circular path of radius r 
shown in Fig. 10.11(a). The result is 

(£ B • d£= B^(r)2nr = MoN/, (10.26) 


from which 


B<p(r) = 


fioNI 

2nr 


(10.27) 


(Note that the current enclosed by the circular path is the same for any height z 
inside the toroid, so B$ is independent of z.) Then the total flux through all N 
rectangular loops of wire is 


<t> = N 


I B*(r) 


dA 


2tt 


mo n 2 i r dA _ moN 2 / r R+a 
Jr 2 jt Jr 


b dr 


Finally, the self-inductance is 


4> Mo N 2 b (R +a 


I 


2tt 


In 


R 


(10.28) 


(10.29) 


EXAMPLE 7 An important application of self-inductance is in resonant oscillator 
circuits. How does the response of a harmonically driven LC circuit, shown in 
Figure 10.12, depend on the driving frequency a>? Realistically we should also 
include resistance, but to understand the principle we may neglect resistance. The 
potential difference across the capacitor is 

Q dl 

— =Socoscot — L —, (10.30) 

C dt 

where Q is the capacitor charge and I = dQ/dt the current. In (10.30) the term 
So cos cot is the alternating EMF of the power supply, and —Ldl/dt is the back 
EMF of the inductor. The equation for the time dependence of Q is 
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FIGURE 10.12 Example 7. A harmonically driven LC circuit is an example of reso¬ 
nance. The applied EMF is S(t) = Sq cos cot. 


d 2 Q ^ 2n So 

~J~2 + W 0 Q = J COS (Ot, 

where the resonant frequency wo is 

wo = 1 / V LC. 


(10.31) 


(10.32) 


The steady-state solution of this inhomogeneous linear differential equation has 
<2(0 oscillating at the driving frequency w, i.e., <2(0 = <2o cos cot. Substituting 
this form into (10.31) we find that the steady-state solution is 


< 2(0 = 


£q cos cot 
L co q — w 2 


(10.33) 


For w < wo, <2 is in phase with £\ for w > wo, it is 180 degrees out of phase. The 
amplitude of oscillation diverges as the driving frequency approaches the “natural 
frequency” wo. (Resistance, which we have neglected, would make the amplitude 
finite and introduce a phase shift.) An oscillator circuit has maximum energy when 
L and C are tuned such that w = wo, an example of resonance. Resonant LC 
circuits are used in radio tuners. A variable capacitor makes it possible to tune to 
a continuous range of frequencies. 


10.3.4 ■ Classical Model of Diamagnetism 

In Sec. 9.1 we studied a classical model of the response of an electron in an atom 
to an applied field B. The result (9.5) is an equation for the resulting change of the 
orbital magnetic moment, which, because it is opposite in direction to B, provides 
a model of diamagnetism. The crucial step in the calculation was to state that the 
orbital radius remains constant as the magnetic field increases from 0 to B. That 
step is justified by Faraday’s Law. 

Refer to Fig. 9.1. For B = 0 the electron (—e) revolves around the nucleus (Ze) 
in a circular orbit satisfying mv^/r = Ze 2 /(4jrfor 2 ). If the field is B k then the 
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equation of motion for the same radius r is 


2 

mv 

r 


= evB + 


Ze 2 
47T€or 2 


The difference between the speeds Sv = v — vq for the two cases, is small; ne¬ 
glecting terms of order (<5u ) 2 we may write 

erB 

5u = —. (10.34) 

2m 

If the speed changes in this way as B is applied, then the orbit remains a circle 
with constant radius r. 

The cause of the increase of the electron speed is not the magnetic force, which 
does no work, but the electric force exerted by the induced electric field of Fara¬ 
day’s Law. We’ll now calculate this increased v and show that it’s the same as re¬ 
quired by (10.34). The kinetic energy changes by —e£ for each revolution, where 
the EMF around the orbit is 


£ 


d<P 

17 


= — nr 


dB 

17' 


That is, the rate of change of kinetic energy is 



where T = Inr/v is the period of revolution. Substituting Faraday’s Law for £ 
we find 


dv er dB 
dt 2m dt 


(10.35) 


This equation for the change of the electron speed is identical to the condition 
(10.34) for the radius to be constant. 

We shouldn’t read too much into this model, because electrons do not obey 
classical mechanics in atoms. But it does give a simple explanation, and about the 
right magnitude, for the diamagnetic response of an atom. 


10.4 ■ MUTUAL INDUCTANCE 

We have earlier (Sec. 10.3.3) considered self-inductance of a single current 
loop. In this section we consider a general system of interacting current loops 
Ci, C 2 , C 3 ,_For steady currents I\, I 2 , h, ■ ■ ■ the magnetic field is the su¬ 

perposition of the fields from the individual loops, and is linear in each of the 
currents . 6 The magnetic flux <t>, through loop C, is a linear combination of the 

6 For simplicity we ignore magnetic susceptibility of any materials in the system. 
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currents 


<t>; =L l I i +Y J M l] Ij. (10.36) 

i¥=i 

Here L, is the self-inductance of C,, and Mjj is the mutual inductance of C, and 
Cj . The inductances depend on the geometry of the system, but not on the cur¬ 
rents. The unit of mutual inductance is the henry (H), the same as self-inductance. 

We define Mjj = <f >, 7 / Ij , where 0, y is the flux through loop C, of the magnetic 
field produced by Ij . Similarly, Mjj = 0 ji/U■ The simplest case is two loops. In 
that case M^\ = 02 i//i is the flux through loop 2 due to unit current in loop 1 ; 
Af 12 = 0 12/^2 is the flux through loop 1 due to unit current in loop 2. We shall 
prove that these mutual inductances are in fact equal. In general, Mjj = Mjj. 

Let B j denote the field produced by Ij, and A j the corresponding vector po¬ 
tential, so that Bj = V x Aj. The flux 0, y is 

Mjjlj = I B 7 ■ dSj = (f) Aj (x,) ■ d£j , (10.37) 

JSj JCj 

where Sj is a surface bounded by C,; the second equality is by Stokes’s theorem? 
In Chapter 8 we obtained a general formula (8.54) for the vector potential, which 
is for this case 


* , N do i Ijdlj 
A j (x;) = — <f> 

4tt Jcj |x* - x 7 | 


(10.38) 


Substituting this result into (10.37), and canceling the common factor Ij, we find 
an equation for the mutual inductance 


Mjj = 



d£j ■ d£j 

I* -*;l 


(10.39) 


a result known as Neumann’s equation? This result shows explicitly that Mjj is 
entirely determined by the geometry of the system. Also, the double integral is 
unchanged if we interchange the subscripts i and j, proving that Mjj = Mjj. For 
the simplest case—two loops—this becomes M \2 = M^\. That means that for 
any two loops, the flux through loop 1 due to unit current in loop 2 is the same as 
the flux through loop 2 due to unit current in loop 1 , a remarkable result! 

There is no expression analogous to (10.39) for self-inductance. In fact, (10.39) 
is an approximation: We have treated the currents as one-dimensional curves, 
which is a good approximation if the wire diameter is small compared to all other 
distances. But this approximation would break down for self-inductance. In fact, 

7 Be sure to understand the notation in (10.37): dSj is the area element of the surface S,-; A j is the 
vector potential of the field. We usually use d A for the area element, to emphasize that it has units of 
area, but here the symbol A is reserved for the vector potential! 

8 The integral in (10.39) could have either sign, depending on the relative directions of the two loops. 
It is conventional to make Mjj positive. 



378 


Chapter 10 Electromagnetic Induction 


we shall see in Example 12 that the self-inductance of a current loop diverges as 
the wire diameter approaches 0. 

The general expression (10.39) looks pretty unwieldy (a double line integral) 
but it can be evaluated analytically in simple cases. More importantly the result 
Mij = Mji is handy for finding mutual inductances, because one of Mjj or M jj 
might be easier to figure out than the other, as is the case in the following exam¬ 
ples. 


EXAMPLE 8 Find the mutual inductance for the circuits 1 and 2 shown in Fig. 
10.13. Ci is a rectangle in the xy plane, of size b x c. C 2 is a larger rectangle in 
the xy plane, infinite in the y direction, and at distance a from the parallel sides 
of Ci in the x direction. It is simple to calculate $ 12 , because B 2 is just the field 
due to a pair of long wires. The mutual inductance is 


A/12 = 


■ = L f 

h Js 

_ j_ r b /2 r 

h J-b/2 L 


B2 • d Ai 


mh 


+ 


mh 


2n(a + b/2 — x) 2n(a + b/2 + x) J 


cdx 


me 

71 


In 




(10.40) 


The first term of the integrand is the flux through C\ of the magnetic field due to 
the current in the wire on the right of C 2 , and the second term is the equal flux 


y 



FIGURE 10.13 Example 8. Geometry of two rectangular loops for which the mutual 
inductance is to be calculated. 
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through Ci due to the wire on the left of C 2 . It would be harder to calculate the 
other flux $ 21 , because Bi—the field of a rectangular loop—is not so simple. But 
that calculation is unnecessary because we know that A /21 = A/ 12 . 


EXAMPLE 9 Find the mutual inductance for the circuits 1 and 2 shown in Fig. 
10.14. Imagine that Ci is a very long, tightly wound, solenoid, with N] turns and 
length £[. C 2 is a loosely wound solenoid, with Nn turns and length € 2 - Both have 
cross section S. 



FIGURE 10.14 Example 9. Geometry of two concentric coils for which the mutual 
inductance is to be calculated. 


In this example it is easy to calculate <J> 2 i, because Bi is just the field of a 
tightly wound solenoid (noN\I\/i\) and <t> 2 i = N 2 B 1 S. Thus the mutual induc¬ 
tance is 


M21 


<t>2\ MO-Wi 

~h = h 


(10.41) 


The result is not invariant under the interchange of 1 and 2. The inductance M 12 is 
not obtained by interchanging 1 and 2 in the result, because the two solenoids are 
different; rather, A /12 h as the same value as A /21 by the general theorem (10.39). 


Mutual inductance is the phenomenon at work in a transformer. The EMF 
around a secondary coil C 2 is related to the changing current in a primary coil Ci 
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_ d ( t>2 dl i 

£ 2 = —— L = -M 2 1-i. (10.42) 

dt dt 

An AC transformer may increase (step up) or decrease (step down) the voltage 
from the primary to the secondary, depending on the mutual inductance. For ex¬ 
ample, the ignition coil in a car takes input voltage alternating between 0 and 12 
V—alternating because of the opening and closing of contact points—and steps it 
up to a high output voltage. Then the electric field at the spark plug exceeds the 
breakdown field for the air-fuel mixture in the cylinder, a spark is produced, and 
the mixture ignites. 

The development of transformers was an important step in the history of AC 
electric power, because long-distance transmission lines use a much higher volt¬ 
age than household appliances or electric lights. Throughout cities and towns 
transformers which step down the voltage for household use, can be seen on tele¬ 
phone poles. 

There is a subtle approximation in (10.42), or indeed in any circuit calculation 
involving inductors. We use magnetostatics to evaluate M 2 \, but in (10.42) the 
magnetic field is not static! This is an example of the quasistatic approximation: 
If the current changes slowly, then the flux calculated from magnetostatics is a 
good enough approximation. For very high-frequency alternating currents there 
would be radiation fields, so the quasistatic approximation would break down. But 
in common applications of electrical engineering, the quasistatic approximation 
is very accurate. 

In Section 8.7 we found the vector potential of acurrent loop; the result (8.106) 
has the form A(x) = A,p(r, 0)0. We now apply that result to a sophisticated 
example. 


EXAMPLE 10 Find the mutual inductance between two coaxial current loops. 
The arrangement is shown in Fig. 10.15. 


z 



FIGURE 10.15 Example 10. Geometry of two coaxial circular loops for which the mu¬ 
tual inductance is to be calculated. 
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To do this problem starting from first principles would be difficult, but by hav¬ 
ing (8.106) and (10.37) we’ve already prepared the way. Notice that to use (10.37) 
to find Mij, we need Ay(x, ), which is the vector potential at the rth loop due to 
the current in the jth loop. In our case the vector potential at Loop 2 due to the 
current in Loop 1 is constant because the spherica l coordinates of points on Loop 
2 are r = s/b 2 + c 2 and 0 = arcsin(fc/V b 2 4- c 2 ), which are both constant. Thus 
(10.37) when applied to our problem becomes, since Ai (X 2 ) = A$(r, 


p2n 

J 0o =0 


lb 1 + c 2 , arcsin(Z?/> 


(pbdip 2- (10.43) 


Substituting the vector potential of Loop 1 from (8.106) into (10.43), and evalu¬ 
ating it with the values of r and 6 at Loop 2 gives, after simplification 


M(far) = M 2 i 



sin <p' dcp'. 


(10.44) 


for the mutual inductance of the loops. (Recall that P n is the Legendre polynomial 
of degree n.) _ 

It is important to note that (10.44) applies only when ~Jb 2 + c 2 > R, the 
condition for which (8.106) was derived; that is the significance of the notation 
M(far). Although (10.44) is an infinite series, containing only terms for odd n as 
discussed in Section 8.7, it converges rapidly if R/y/b 2 + c 2 is small compared 
to unity, i.e., if Loop 2 is far away compared to R. The series starts off with the 
leading term (n = 1) as M = n,oR 2 b 2 7t/2{b 2 + c 2 ) 3 ^ 2 + • • •; the higher terms 
can be evaluated numerically. 

If the two loops are near enough together so that V b 2 + c 2 < R, then the 
mutual inductance becomes 


M(near) = M 2 \ 




sirup' d(f>'. 


(10.45) 


The mutual inductance for the interesting basic problem of two coaxial and copla- 
nar loops is obtained by setting c = 0 in (10.45). 
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10.5 ■ MAGNETIC FIELD ENERGY 

By analyzing the energy balance for an isolated inductor, we can deduce the for¬ 
mula for the energy content of a magnetic field. The power that must be supplied 
to maintain a current / in an inductor while the current is changing, in the pres¬ 
ence of the back EMF £ = — L dl/dt, is 

P = —IS = +/L —. (10.46) 

dt 

Thus the total energy that must be supplied to establish a current /, starting from 
0, is f Pdt = f LIdl = \L1 2 . By conservation of energy, that amount of energy 
is stored in the inductor 


U = -LI 2 . (10.47) 

2 

We attribute the stored energy to the magnetic field B produced by the current /. 
Therefore, we must be able to rewrite U directly in terms of B. 

As a first step toward relating U and B, we express U in terms of the current 
and vector potential A, where B = V x A. The quantity LI is, by definition, the 
flux of B through a surface S bounded by the current loop C, so 

LI = j B dS = j> A di, (10.48) 

the second equality by Stokes’s theorem. Thus the magnetic energy can be writ¬ 
ten as 


U M = ^£lA-di. (10.49) 

Although we have derived (10.49) for a single current loop, the same formula 
holds for multiple loops if we sum the right-hand side over all the loops. 9 Also, 
we may generalize the result to a continuous distribution of current, with volume 
current density J(x), by replacing Idi by J d 3 x, because idA is the current dl in 
a tube of area dA. Thus, in general, 

U M = ^jj-Ad 3 x. (10.50) 

Note that (10.50) is analogous to the equation for electrostatic energy Ue = 
5 f pVd 3 x that we derived in Chapter 3; in both cases, one-half of source density 
times potential is energy density. 

Finally, we may write Um entirely in terms of the magnetic field. For magne¬ 
tostatics, Ampere’s Law is that J = V x H. The integrand in (10.50) is 

(V x H) • A = V • (H x A) +H- (V x A), (10.51) 

9 See Exercise 26. 



10.5 Magnetic Field Energy 


383 


where the right-hand side follows from a vector identity. The volume integral of 
V-(HxA) is 0 for a localized field, i.e., a field that approaches 0 at oo, by Gauss’s 
theorem: It equals the flux of H x A through the surface at oo, where the field is 
0. The final term in (10.51) is H • B. The total energy is 

U M = ^ J H-Bd 3 x. (10.52) 

Attributing this energy to the magnetic field, we deduce that the local energy 
density is 


mm(x) = jH(x) ■ B(x). (10.53) 

In the absence of magnetic materials, H is B/no- Then the magnetic energy 
density, associated only with the field B, is um = Recall for comparison 

that the energy density of the electric field is me = \eoE 2 in the absence of 
dielectric materials. In the presence of matter, the electric energy density is • E 
and the magnetic energy density is j H • B; these energies include both field energy 
and polarization energy of the matter. 

The reader has the right to feel dissatisfied with the above derivation of the im¬ 
portant fundamental result (10.53), which was only heuristic and had some gaps. 
A rigorous derivation will be given in Chapter 11, based on local conservation of 
energy. 


EXAMPLE 11 What is the magnetic energy per unit length for a long cylindri¬ 
cal solenoid, tightly wound with n turns per unit length of wire carrying current 
/? This is a good example, because it is the simplest form of magnetic field—a 
uniform field inside the cylinder. 

Figure 10.16 shows two views of the solenoid. We will use three methods to 
find its energy per unit length U' M with units J/m. 


Method 1. We first find \J' M by using (10.53), which says that the energy density 
is um = B 2 / 2/uo. Inside the solenoid B = /io«/k, and outside B = 0. Therefore 
the energy density inside (r < R) is u m = dan 2 1 2 /2, and outside (r > R) it is 
zero. The volume enclosed by a length £ of the solenoid is n R 2 £. Finally then 




UMttR 2 l 


n 2 i 2 R 2 - 


(10.54) 


Method 2. We next find \J' M by using (10.47), in the form U' = L'l 2 / 2. In 
(10.25) we found the total inductance L of a solenoid of length £ with N turns. 
Dividing by £ gives the inductance per unit length L' = L/l = /ion 2 A, where 
n = N/£ and the area of the cylinder is A = n R 2 for our case. If we substitute 
(10.25) into (10.47) the result for U' M is the same as (10.54), as we expected. 
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B = 0 

tums/length = n 


z axis 


(a) 


B = 0 



FIGURE 10.16 Example 11. The magnetic field energy in a solenoid. The view in (b) 
is from the +z axis looking in the direction of —k. 


Method 3. In the two previous calculations the magnetic energy was taken to 
be inside the cylindrical volume of the solenoid—that assumption was explicit in 
Method 1 and implicit in Method 2. However, we know from (10.49) and (10.50) 
that Um may be associated with the current itself. It will therefore be interesting 
to find U' M from that principle. We proceed by writing (10.50) in the equivalent 
form, appropriate for surface currents, 

U M = ^j>K-\dS. (10.55) 

In applying (10.55) to our problem we have, for the current per unit length on the 
solenoid’s surface, K = n/0 as shown in Fig. 10.16(b). Also, the surface area of 
a length l of the solenoid is S = 2rtRl. Now we need A(x). 

A Nice Amperian Trick. To evaluate (10.55) requires the vector potential A(x) 
for a solenoid. We’ll determine this interesting quantity by an Amperian loop 
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trick! By Stokes’s theorem, the flux of B through a cross section of the solenoid 
is 


<$> = j)A-di, (10.56) 

where C is the boundary curve. Equation (10.56) is reminiscent of Ampere’s Law. 
If there’s enough symmetry, we can use (10.56) to find A if <£ is known, just as if 
there’s enough symmetry we can use Ampere’s Law to find B if I is known. Figure 
10.16(b) shows an end view of the solenoid with two closed curves, Cj and C 2 , 
along which we can apply (10.56). The current around the solenoid is constant 
and flows in the <j) direction, so from symmetry we take the form A(x) = A,p(r)<j) 
along both C\ and C 2 . Now for C 2 the enclosed flux is within R, so that <t >2 = 
B z nR 2 = hohIttR 2 . For C\ the enclosed flux is <J>i = B z nr 2 = [i^nlnr 2 . In 
both cases the right-hand side of (10.56) is A^(r)2nr. Therefore 

A ext = A 0 (r)0 = — nIR - if) for r > R, (10.57) 

2 r 

and 

Aim = A^(r)0 = —</> for r < R. (10.58) 

Notice that A is continuous at r = R and has there the value A(R) = (jxonl R/2)cj). 

Now we return to the calculation of the magnetic energy (10.55). Substituting 
K, A and the surface area S, we find 

, Um 1 - /1 \ 2 2:tRI 1 ??? 

U' M = -j- = -(«/)</> • ( 2 ^ 0 nIRj cf> = -norrn 2 I 2 R 2 , (10.59) 

in agreement with the previous methods. 

The vector potentials in the exterior and interior regions (10.57) and (10.58) 
are related to the respective magnetic fields by B = V x A. It’s interesting to 
note that although B = 0 outside the solenoid, A is not zero there and does 
have physical significance. For example, if the solenoid’s current were to vary 
with time, A would also become a function of time, of the form A = A,p(r, 
Whenever A changes with time there is an associated electric field E = —3A/3 1, 
because Faraday’s law written in terms ofAisV xE = Vx (—3A/3?). 10 For a 
solenoid with a varying current there is an azimuthal electric field and so nonzero 
EMF around C\ and C 2 - 


The fact that the energy in an inductor is \Ll 2 provides another basis for cal¬ 
culating the self-inductance L of an inductor: Calculate the field energy (10.52), 

10 We’ll leam more about the connection between E. A, and the scalar potential V in Chapter 11. 
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and set it equal to \LI 2 . This technique is sometimes easier than using the def¬ 
inition L = 4>//, because calculating 4> can be difficult if it involves self-flux 
through wires of finite size. The next example illustrates the method. 


EXAMPLE 12 Find the self-inductance per unit length of a long coaxial cable for 
which the inner conductor is a solid cylinder of radius a and the outer conductor 
has radius b. Let £ be the length of the cable. Assume that the current in the inner 
cable is uniformly distributed over the cross section. 

Current / flows in the inner conductor, and —I in the outer conductor. By 
Ampere’s Law the magnetic field is B(r)<j), and 


B(r) = 


IM)Ir/(2 tt a 2 ) for r < a 
HqI/( 2nr) for a < r < b 

0 for r > b. 


Therefore, using (10.52), the total energy is 



(10.60) 


(10.61) 


The integrals over r are elementary, and it is straightforward to show that the 
self-inductance per unit length is 


L r _ L _ W ]n b uo 
£ 2n a 


(10.62) 


Note that the self-inductance of the cable diverges logarithmically in the limit 
a —> 0. If the inner conductor were a thin cylindrical sheet of radius a, then L' 
would be just the first term on the right of (10.62). The second term, which arises 
from the field energy inside the inner conductor, is the internal self-inductance per 
unit length of a wire of finite radius. 11 

The case of an isolated wire is similar. For a wire of finite radius a the internal 
self-inductance per unit length is again no/8n. However, in the limit a 0 
the external magnetic field energy density near the wire, which is B 2 /2/io = 
/io/ 2 /(87r 2 r 2 ), goes to infinity as r —> 0 so fast that the external contribution to 
the inductance becomes infinite. 12 


10.5.1 ■ Energy in a Ferromagnet 

Equation (10.52) for the magnetic energy is correct if the only magnetic materials 
present are linear materials—diamagnetic or paramagnetic. Its derivation relies 

"Smythe, Sec. 8.09. 

,2 Recall (7.23) that the capacitance per unit length of a wire with length L and radius a is C' = 
27re 0 / In (L/a). 



10.5 Magnetic Field Energy 


387 


on the assumption that B is proportional to I. In the presence of ferromagnetic 
materials, that assumption is not true. I is the free current, and although H is 
proportional to /, B is not. In ferromagnetic systems, the definition of magnetic 
energy is problematic, because of the phenomenon of hysteresis. The energy re¬ 
quired to prepare the system in a specified state depends on the path by which the 
state is created. 

To analyze the energy for a system with a ferromagnet, consider a current- 
carrying coil of N turns, wound around a solid ferromagnetic ring, as shown in 
Fig. 10.17. If the flux pertum changes by 5$ in time St, then the EMF induced in 
each tum is £ = —8<&/8t. The work that must be done against this back EMF to 
maintain the current I in all N turns of wire is 5IV = — £NI St = N18<t>. Now, 
NI = § C H • d£ by Ampere’s Law, where C is an Amperian loop linked with the 
coil (i.e., C is a circle inside the iron ring). Thus the work is 

5W = (j) <5<t> H - dt. (10.63) 

For a process in which the magnetic field changes by <5B the change in flux is 
8<t> = SSB, where S is the area enclosed by one tum of the coil. Inserting this 
result in (10.63) we encounter Sdl, which is a volume element d 3 x in the field. 
That is, the work done to change the magnetic field by <5B is 

8W = J SB Ud 3 x. (10.64) 

For a linear material (B = //.H), 8W is equal to the change of magnetic 
energy SUm, with Um given by (10.52). In that case the total work done as 
the system undergoes a cyclic process, starting and ending in the same state, is 
t/fii* 31 _ jjxrnuai _ q g ut f or a ferromagnet the total work done in a cyclic process 
is not 0. In other words, the process of changing B is not a conservative process. 
There is heating of the system as B changes. We cannot define the magnetic en¬ 
ergy merely by integrating the work done on the system. 



FIGURE 10.17 Example 13. A toroidal solenoid filled by a ferromagnetic material. 
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EXAMPLE 13 Consider an iron ring wound with N turns of wire carrying cur¬ 
rent I. For simplicity assume the radius r of the ring is much larger than the 
dimensions of its cross-sectional area A. How much energy must be supplied to 
carry the iron through one hysteresis cycle? 

By Ampere’s Law, H inside the torus, at distance r from the symmetry axis, is 


i - 

H = — 0. (10.65) 

2 nr 

We may treat rasa constant. Now let I vary to change H and B. The power that 
must be supplied by the power supply is P — —IS, where £ is the back EMF, 
which is —NAdB/dt by Faraday’s Law. So, 


P = 


2nrH 
N 


dB dB 

NA — = VH —, 
dt dt 


( 10 . 66 ) 


where V — 2 nr A is the volume of iron. The total energy supplied as H and B 
vary around one hysteresis cycle is 


W 


= I Pd ' = v f 


HdB. 


(10.67) 


In other words, <fi HdB is the energy loss in the iron, per unit volume perhystere- 
sis cycle. 

If H is plotted as a function of B, as illustrated previously in Fig. 9.19, then 
the area enclosed by the hysteresis loop in the BH plane is <f HdB. The energy 
loss per unit volume equals the area enclosed by the hysteresis loop. 


EXERCISES 

Sec. 10.1. Motional EMF 

10.1. A metal bar (mass m, length i) slides on horizontal conducting rails in a vertical 
uniform magnetic field, as shown in Fig. 10.18. The electrical resistance of the 
closed circuit consisting of the rails and the bar is R, independent of the position x 
of the bar. 


uniform B 



FIGURE 10.18 Exercise 1. A conducting bar of mass m slides on conducting rails. 
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(a) If the bar moves with velocity i> in the x direction, what is the current in the 
circuit? 

(b) What is the force on the bar? Note that this is a braking force. 

(c) If the initial velocity of the bar is no, how far does the bar slide before stopping? 
(Neglect friction with the rails.) 

(d) Show that the initial kinetic energy of the bar has all been expended as Joule 
heat when the bar stops. 

10.2. In 1996 astronauts on the space shuttle Columbia did an interesting experiment, 
to test an idea for generating electric power in satellites in orbit around the Earth, 
using a conducting tether. Columbia deployed a satellite attached to the space shut¬ 
tle by a 20-kin long tether. Motion of the tether through the Earth’s magnetic field 
would generate a voltage drop across the tether. (In vacuum, charge would build up 
on the ends, but in the ionosphere current would flow, the return current being in 
the plasma.) 

Assume the tether is a straight wire, held taut by the shuttle and satellite at 
its ends. The whole system revolves around the Earth with speed 8 km/s. Assume 
for simplicity that the wire direction i, velocity v, and Earth’s magnetic field B 
are all mutually orthogonal. The strength of the magnetic field is 0.3 x 10 -4 T at 
the height of the satellite. Calculate the voltage difference between the ends of the 
wire. (In the NASA experiment the tether broke or melted through before the end 
of the experiment. But data collected before this catastrophe showed the potential 
for generating power by this method.) 

10.3. A copper disk of radius 5 cm rotates at 20 revolutions per second, in a magnetic field 
B = 0.5 T n perpendicular to the disk. The rim and center are connected electrically 
by a fixed wire with sliding contacts. The total resistance is 10 £2. Calculate the 
induced current. 

10.4. A magnetohydrodynamic (MHD) generator is a device that has been proposed for 
generating power from flow of ionized plasma, e.g., in nuclear fusion reactors. The 
plasma flows in the z direction through a rectangular pipe, whose cross section is 
parallel to the xy plane, and there is a magnetic field B = B i in the plasma. The 
x, y, z dimensions of the pipe are i v, h, i, respectively. The walls at x = ±w/2 are 
insulating, and the walls at y — ±h/2 are conducting. 

(a) Show that the potential between the conducting walls is V = vBh , where v is 
the fluid velocity. 

(b) Suppose the conducting walls are connected by a wire with resistance R. De¬ 
termine the current in the wire, if p is the resistivity of the plasma. (Hint: There 
are currents in series in the wire and in the plasma.) 

10.5. A method for measuring the Earth’s magnetic field is to use a flip coil and ballis¬ 
tic galvanomenter. The coil, of radius a, turns quickly through 180 degrees. The 
galvanometer measures the total charge that flows through the coil as it flips. 

(a) Explain how the vector B can be measured. 

(b) Suppose the coil axis is initially parallel to a magnetic field of strength 0.5 x 
10 -4 T, and a — 5 cm and the resistance of the coil is R = 0.1 £2. What total 
charge flows through the coil as it flips over? 
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Sec. 10.2. Faraday’s Law 

10.6. There is a time-dependent current l s (t) in a long and densely wound solenoid. 

(a) Determine the electric field at radius r on the midplane of the solenoid, both 
inside and outside the solenoid. (Hint: The direction of E is azimuthal; use 
the Amperian loop trick.) This problem is another example of the quasistatic 
approximation. Use magnetostatics to derive a formula for B due to a steady 
current, and use the same formula for the varying current. 

(b) From your result of (a) calculate the curl of E at radius r. 

10.7. How can the purely azimuthal electric field E = E^cj) = —(Eor/a)4> be produced 
in a cylindrical region of radius a? 

10.8. A long straight wire carries an alternating current I(t) = Iq cos ait. Nearby is a 
square loop. The wire lies in the plane of the loop, parallel to two sides of the 
square, which are at distances a and b from the wire. (The side of the square is 
b — a.) Determine the current induced in the square loop if its resistance is R. 

10.9. A circular loop of wire with radius a and electrical resistance R lies inthexy plane. 
A uniform magnetic field is turned on at time t = 0; for t > 0 the field is 

(a) Determine the current / ( t ) induced in the loop. 

(b) Sketch a graph of I ( t ) versus t. 

10.10. A metal disk of radius a, thickness d , and conductivity a is located in the xy 
plane, centered at the origin. There is a time-dependent uniform magnetic field 
B(r) = B(t) k. Determine the induced current density J(x, t) in the disk. 

10.11. A classroom demonstration of eddy currents and magnetic braking is to let a small 
cylindrical magnet, e.g., a cow magnet, fall through a vertical copper tube with a 
slightly larger diameter than the magnet. Analyze the induced eddy currents, and 
the force on the magnet, and explain why the magnet falls at a slow terminal veloc¬ 
ity. 

10.12. Consider the circuit shown in Fig. 10.19, in the center of which there is a long 
solenoid with a changing magnetic field B (t). Two voltmeters are connected as 
shown across the two resistors R\ and /?2- 


A 



FIGURE 10.19 Exercise 12. The two voltmeters V\ and Vj have different readings. 





Exercises 


391 


(a) Determine the readings on the two voltmeters. 

(b) How is it possible that the readings are different? The readings are different 
even if R \ = Ri- Should not there be a unique voltage drop between points A 
and B? (Reference: R. H. Romer, Am. J. Phys. 50, 1089 (1982).) 

Sec. 10.3. Applications 

10.13. The alternator in a car consists of a rectangular coil, with 250 turns of wire and area 
0.01 m 2 , rotating in a 0.1 T magnetic field. (The field is produced by a direct-current 
electromagnet.) If the rotation rate is 10 3 rpm, what is the peak output voltage? 

10.14. Consider a betatron with these parameters: the radius of the electron orbit is 0.5 m; 
the kinetic energy of an electron injected into the accelerator is 2.0 MeV; and the 
rate of increase of the magnetic flux through the area of the toroidal vacuum cham¬ 
ber is 25 Wb/s. The electrons are ejected after 4 ms of acceleration. 

(a) Compute the magnitude of the induced electric field. 

(b) Compute the work done on an electron per revolution around the orbit. 

(c) Compute the number of revolutions completed by an electron before ejection. 
(Approximate the electron speed by c.) 

(d) Compute the final kinetic energy of the electron. 

(e) In order to keep the radius r of the electron path constant, 8 at r must be one 
half of the average of B over the area enclosed by the circular orbit. What is 
dB /dt at r during the acceleration? 

10.15. Show that a series LC circuit without resistance is a harmonic oscillator. Calculate 
the frequency if L = 300 mH and C = 1 /iF. 

10.16. Consider a coaxial cable consisting of two long concentric hollow conducting 
cylinders with radii a and b. A current I travels up the inner cylinder, and down 
the outer cylinder. Determine the self-inductance per unit length, both from the 
definition L = <J>/ /, and from the magnetic energy j Z./ 2 . 

10.17. The circuit shown in Fig. 10.20 consists of a switch, inductor, light bulb, and bat¬ 
tery. Assume R = 10 £2 and L = 10 mH. The switch closes at time t = 0. At what 
time does the light reach 90% of its final brightness? (To answer this question one 
must make some assumption about the thermal conductivity of the bulb filament. 
Assume it is infinite, i.e., no time delay between Joule power and light intensity.) 



FIGURE 10.20 Exercise 17. 
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10.18. Determine the self-inductance per unit length of circumference, L' = L/2nR , for 
a toroidal solenoid —a wire wound around a torus. Show that in the limit R -> oo 
with A and n fixed, where R is the toroid radius, A the cross-sectional area, and 
n the number of turns per unit length, the result approaches the self-inductance 
per unit length of a cylindrical solenoid. (The self-inductance of the latter case is 
(10.25).) 

Sec. 10.4. Mutual Inductance 

10.19. Find the mutual inductance M between a long cylindrical solenoid, tightly wound 
with ni turns per unit length, and a short solenoid inside it and far from the ends. 
The axis of the short solenoid is parallel to the axis of the long solenoid, and the 
short solenoid has cross-sectional area A and N s turns. Notice that the mutual in¬ 
ductance is the same for any uniform cross-sectional shapes of the solenoids. Note 
also that M is the same whether the N s turns of the inner solenoid are wound tightly 
or loosely. 

10.20. Figure 10.21 shows an equilateral triangular loop whose altitude is a, and a long 
wire in the same plane. The wire is parallel to and at distance b from the base of the 
triangle. What is the mutual inductance? Assume the return path of the long wire 
is far away. [Answer: 



10.21. The ignition coil in a car supplies 20 kV to the spark plugs. Suppose the maxi¬ 
mum current in the primary coil is 4.0 A, and the current is interrupted 100 times 
per second (by the distributor points). Estimate the mutual inductance of the pri¬ 
mary and secondary coils. (Hint: You will have to make a reasonable model of the 
function /(?)■) 

Sec. 10.5. Magnetic Field Energy 

10.22. Equal but opposite currents +/ and —I flow in two long parallel plates, as shown 
in Fig. 10.22. The plates have width w and separation d, where d is small. 

(a) Neglecting edge effects, find the magnetic field between the plates, from 
Ampere’s Law. 
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FIGURE 10.22 Exercise 22. End view of long parallel plates of width w and separation 
d, carrying total currents +/ and —/. 


(b) Calculate the magnetic field energy per unit length. 

(c) Use the result of (b) to show that the self-inductance per unit length is pt^d/w. 

10.23. Determine E/B if the electric and magnetic field energy densities are equal. Eval¬ 
uate the ratio in m/s. What is this constant? 

10.24. Consider the circuit shown in Fig. 10.23. Assume the switch has been in the posi¬ 
tion shown long enough that the steady state has been reached. Now at t = 0, flip 
the switch. 



(a) Determine the current / ( t ) for t > 0. 

(b) Determine the total energy dissipated in the resistor after t = 0. 

(c) Show that the result of (b) is equal to the energy stored in the inductor at t = 0. 

10.25. Show that the magnetic field energy of two current loops is 

\L\I\+ {l 2 I%+MI\I 2 

where L\ , Z. 2 , and M are the self and mutual inductances. By convention we take 
M to be positive, and the signs of I\ and Ij determine the directions of the currents. 
(Reference: The Feynman Lectures on Physics, Vol. II, Sec. 17-8.) 

10.26. Consider a collection of current loops C i, C 2 , C 3 , . . . fixed in space. 

(a) Show that the total energy that must be supplied to establish currents I\, I 2 , 
/ 3 ,... is 

u = Y,\ L i I i+E M ij I i I j- 

i i<j 
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(Hint: The power is — £ where £,■ is the EMF induced by the changing 
flux through C ( -.) 

(b) Show that U = jJ2i A' • 

10.27. Following are two exercises on astrophysical magnetic field energies. 

(a) The average magnetic field in the interstellar space of our galaxy is about 3 x 
10' 1(1 T. If the galaxy is taken to be a disk-shaped volume, of diameter 10 21 m 
and thickness 10 19 m, what is the total magnetic energy of the galaxy? The 
total power radiated from all stars in the galaxy is about 10 37 W. How many 
years of starlight are equivalent to the stored magnetic energy? 

(b) The magnetic field at the surface of a neutron star, or pulsar, is about 10 8 T. 
What is the magnetic energy density for this field? Using the mass-energy re¬ 
lation E = me 2 , find the mass density that corresponds to the magnetic energy 
density. Compare this to the mass density of the neutron star, assuming it has 
the mass of the sun in a sphere of radius 10 km. 

10.28. Assume the magnetic field of the Earth, in the region outside the Earth, is a pure 
dipole field corresponding to dipole moment m = 8 x 10 22 J/T. (This is not com¬ 
pletely accurate, because the solar wind cuts off the field at a distance of order 10 
Earth radii, but it’s a good enough approximation for this exercise.) Calculate the 
total energy of the magnetic field outside the Earth. Compare the result to the rota¬ 
tional kinetic energy of the Earth. 

General Exercises 

10.29. The solenoid in Fig. 10.24 is long, and has n turns per unit length and cross sec¬ 
tion A. 



(a) If the current in the solenoid changes from I\ to I 2 , how much charge passes 
through the resistor? 
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(b) If the current in the solenoid, as a function of time t, is 

Z(0 = /ie _,/T +/ 2 (] 

what is the current IrC) in the resistor? 

(c) Use a computer to plot 7 ( t ) and Ir (t), as functions of t/z, for I\ = 1 A and 
12 = 2 A. Describe in words what is happening. 

10.30. In Fig. 10.18 the conducting rod m is in contact with metal rails and the whole sys¬ 
tem is immersed in a uniform magnetic field of strength B = 0.5 T in the direction 
perpendicular to circuit. (The bar. rails and cross piece at the end of the rails form 
a continuous conducting path.) 

(a) If the rod moves toward the right with a speed of 4.0 m/s, and the separation of 
the rails is l = 0.10 m, fi nd the magnitude and direction of the induced emf. 

(b) If the circuit has a resistance of 0.20 S2 when the rod is at a certain position, 
find the force exerted on the rod by the magnetic field. 

(c) Compare the power of the external force and the power lost in Joule heating. 
Ignore friction. 

10.31. Calculate the terminal velocity of a rectangular loop falling out of a region of B, as 
shown in Fig. 10.7. Take B = IT, R = 0.1 S2, M = 0.1 kg, and w = 0.1 m. 

10.32. For the Rowland Ring of Fig. 9.18 (Chapter 9): 

(a) Show that if the primary current is changed by A 7 p then the incremental change 
in H in the magnetic material is 

AH = Np A/ p /(2jr/?o). 

(b) Show that the resultant change in B in the magnetic material satisfies 

A B 2tt RftRgAQg 
M = ANsNpAIp ’ 

where Rg is the resistance of the secondary circuit including the galvanome¬ 
ter and AQg is the amount of charge that passes through the galvanometer. 
(Assume that the cross-sectional dimension of the magnetic material is small 
compared to Rq.) This relation is used to construct magnetization curves of B 
versus H such as those in Fig. 9.19 of Sec. 9.6. 


Computer Exercises 

10.33. Consider damped oscillations in a series LRC circuit. The charge on the capacitor 
Q(t), and the current in the circuit 7(f) = dQ/dt, are functions of time t. 

(a) Show that the differential equation satisfied by Q(t) is 


L 


d 2 Q 

df2 


+ R 


d_Q 

dt 



(b) Use a computer program to solve the equation numerically. Asa template, here 
are Mathematica commands to solve the equation for parameter values L = 1, 
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R = 1, and C = 1 (in some system of units) and with initial values Q( 0) = 4 
and 7 (0) = 0: 

{L,R,C>={1,1,1> 
eqs={Q’ [t]==I [t] , 

I’[t] ==- (R/L) I [t] -Q [t] / (L*C), 

Q[0]==4,I[0]==0> 

soln=NDSolve [eqs, {Q [t] , I [t] }, {t, 0,20}] 

Qsol [t_] : =Evaluate [Q [t] /. soln] 

Isol[t_]:=Evaluate[I[t] /. soln] 

After these commands have been executed, Qsol[t] and Isol[t] are the 
functions Q(t) and 7(f). 

(c) Make plots of Q(t ) and I{t) for overdamped oscillations (R 2 > 4 L/C), 
underdamped oscillations (R 2 < 4L/C), and weakly damped oscillations 
(R 2 «4 L/C). 

10.34. The differential equation for a harmonically driven series LRC circuit is 


.d l Q dQ Q f 

L ^T + R 7r + c- e °““'- 


(a) Let L = 1, C = 1, and R = 0.1, in some system of units. Solve the equation 
numerically, for initial values Q( 0) = 0 and 7 (0) = 0, and driving frequency 
co = 0.5. After a transient period, the solution settles down to a steady state 
oscillation at the driving frequency co. 

(b) Now vary co , in the range from 0.5 to 1.5. Show, by plots of Q(t) and 7(r), that 
the steady state oscillations have maximum amplitude if co & 1 / -J LC. Explain 
this result in terms of resonance. 

(c) Show graphically that there is a phase shift between the applied EMF and the 
charge on C. 



CHAPTER 



The Maxwell Equations 


Before the development of field theory by Faraday and Maxwell, electric and 
magnetic phenomena were ascribed to “action at a distance” between charges and 
between currents. Action at a distance refers to the experimental observation that 
charges or currents exert forces on each other even though they are not in con¬ 
tact. The Coulomb force between static charges, and the force between currents 
explored by Ampere, both appear to be examples of action at a distance. 

Faraday, from his extensive experience with experiments on electricity and 
magnetism, developed the idea that there must be something physical in the space 
around charges and currents, which he called lines of force. This idea was the 
kernel of field theory. 

Maxwell expressed Faraday’s idea in four partial differential equations—local 
relations—of fi elds and sources. Maxwell coined the phrase electromagnetic field 
as the name for his theory, because the action takes place not at a distance, but 
locally in the space around the sources. Maxwell’s equations were published in 
1864, and we still use the same equations today. This theory has really stood the 
test of time! 

The interaction between charges in field theory is quite different from action 
at a distance. A charge creates an associated electromagnetic field throughout 
the surrounding space; another charge experiences a force exerted by the local 
field at its position. The fields have energy and momentum. They are not mere 
mathematical constructions, but have just as much physical reality as the charges. 
From this viewpoint, the answer to the question “What is the Universe made of?” 
is “particles and fields.” 

Maxwell’s mathematical theory revealed that light is an electromagnetic phe¬ 
nomenon. Also, it predicted the existence of other forms of electromagnetic ra¬ 
diation, unknown at the time, such as radio waves and X rays. The theory led to 
great advances in technology. Recent applications of the theory are cellular tele¬ 
phones, positional devices for global navigation, and communication with space¬ 
craft. 

The purpose of this chapter is to introduce the Maxwell equations, both in 
vacuum and in the presence of matter, and to examine some of their consequences. 
In particular, we will begin the study of electromagnetic waves in Sec. 11.5. 
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11.1 ■ THE MAXWELL EQUATIONS IN VACUUM AND 
THE DISPLACEMENT CURRENT 

The Maxwell equations express relations between the electromagnetic fields 
E(x, t) and B(x, t), and between the fields and their sources. We start our study 
by presenting the Maxwell equations in vacuum. Then in Sec. 11.3 we will con¬ 
sider how the equations should be rewritten in the presence of dielectric and 
magnetic materials. 

In considering Maxwell’s equations in vacuum, we don’t necessarily mean in 
empty space. We have in mind a space in which there are sources, generally vary¬ 
ing in time, of charge p(x, t) and of current J(x, t). We may think of these as 
ideal sources, i.e., not associated with matter. In reality currents flow in wires, 
and charge resides in electrons or ions; real physical sources are associated with 
matter. But for many practical problems—and the Maxwell equations in vacuum 
have wide-ranging practical applications—the vacuum equations, with idealized 
sources, are an accurate description of electromagnetic phenomena. In such cases 
the material properties of the physical sources can be neglected. On the other 
hand, if materials are present with significant magnetic or electric susceptibili¬ 
ties, then bound charge and current must be taken into account, as explained in 
Sec. 11.3. 

We have already studied three of the Maxwell equations, 

V • E = p/e 0 (11.1) 

V • B = 0 (11.2) 

V x E = —dB/dt. (11.3) 

Equations (11.1) to (11.3) are correct and complete as they stand. They hold for 
time-dependent systems as well as for static systems. The first (11.1) is Gauss’s 
Law for the electric field E and charge density p, which states that the electric field 
diverges from charges. The second (11.2) is Gauss’s Law for the magnetic field 
B, which states that there are no magnetic monopoles, i.e., no isolated pointlike 
magnetic sources. Equivalently, lines of B are closed loops. The third equation 
(11.3) is Laraday’s Law of electromagnetic induction, which states that E curls 
around the rate of change of B if B varies in time. 

Another necessary equation in electrodynamics is the continuity equation, dis¬ 
cussed in Sec. 7.2, which expresses the local conservation of charge: 

V-J=-^. (11.4) 

dr 

In words, electric charge is conserved throughout space: the current flux away 
from a point is equal to the rate of decrease of the charge at that point. 

In addition to the above equations, we have previously studied Ampere’s Law 
for magnetostatics: 


V x B = /u.qJ (magnetostatics). 


(11.5) 



11.1 The Maxwell Equations in Vacuum and the Displacement Current 


399 


Equation (11.5) states that electric current is a source of magnetic field, and B 
curls around a constant current. Although (11.5) is true for time-independent cur¬ 
rents, it is not complete and it must be modified for time-dependent problems. 
The modification consists of an additional term, called the displacement current, 
added to the right-hand side, which we will derive presently. Remarkably, it is pos¬ 
sible to construct the complete equation by a purely theoretical analysis, which is 
how it was found originally by Maxwell. 

Maxwell was the first theorist to construct differential equations such as (11.1) 
to (11.5) for electromagnetism. But he found that (11.1) to (11.5) are not math¬ 
ematically self-consistent. Specifically, there is an inconsistency between (11.5) 
and (11.4) if the charge system is not static. To understand this point, let’s first 
examine the divergence of (11.3), which will prove to be consistent, and then the 
divergence of (11.5), which will prove to be inconsistent for nonstatic electric 
fields. In the case of Faraday’s Law, the divergence of the left-hand side of (11.3) 
is 0, because the divergence of the curl of any vector is 0; and the divergence of 
the right-hand side is also 0, by (11.2) 

v (f) = l <VB, = a <1L6) 

(It is important to understand that V and 3/3 1 commute.) Thus equations (11.3) 
and (11.2) are consistent. But now consider the case of Ampere’s Law from mag¬ 
netostatics (11.5): Again the divergence of the left-hand side is 0. But the diver¬ 
gence of the right-hand side /i-o V • J is in general not 0, by (11.4). V • J is 0 if 
the charge density is static, but in general V • J is —dp/dt. Equations (11.5) and 

(11.4) are thus inconsistent for time-dependent systems. 

11.1.1 ■ The Displacement Current 

Maxwell made the equations of electromagnetic fields consistent by modifying 

(11.5) , for the general time-dependent case, adding another source term that he 
called the displacement current. The complete equation, in vacuum, is the fourth 
Maxwell equation 

3E 

V x B = /uoJ + do^o — ■ (1L7) 

Ot 

We may call this equation the Ampere-Maxwell Law. Itis consistent with the oth¬ 
ers. The divergence of the left-hand side of (11.7) is 0 as before. The divergence 
of the right-hand side is also 0: The divergence of J is —dp/dt by conservation 
of charge, and the divergence of codE/dt is +dp/dt by Gauss’s Law. The four 
Maxwell equations, (11.1) to (11.3) and (11.7), along with the continuity equa¬ 
tion (11.4), are a self-consistent theory of the electromagnetic field. These are the 
field equations in vacuum. They are tabulated in the second column of Table 11.1. 

These equations speak to us if we understand the language of vector calculus. 
Faraday’s Law tells us that a changing magnetic field acts as a source of electric 
field, as E curls around the change of B. The Ampere-Maxwell Law says that 



400 


Chapter 11 The Maxwell Equations 


TABLE 11.1 The Maxwell equations 


Equation 

In vacuum 

In matter 

Gauss 

V • E = p/e 0 

V • D = p f 

Gauss 

V • B = 0 

V • B = 0 

Faraday 

V xE = — 3B/3f 

V x E = — 3B/3t 

Ampere-Maxwell 

V x B = poJ + P-oeodE/dt 

VxH=J f + 3D/3t 


a changing electric field acts as a source of magnetic field, similarly. Maxwell 
named this source of B the displacement current. The displacement current den¬ 
sity in vacuum is defined as 


3E 3D 
Jd = 60 37 = ~dt' 


( 11 . 8 ) 


where D = 6 qE is the displacement field in vacuum. That is, the Ampere-Maxwell 
Law is 


VxB = go(J + Jo). (11.9) 

Both charge current J(x, t) and displacement current J d (x, t) create the magnetic 
field. In fact, it is possible to have a magnetic field without any charge current at 
all, created only by an electric field that varies in time. We’ll see later that this 
phenomenon occurs in electromagnetic waves. By integrating (11.9) over an open 
surface S with boundary curve C, and applying Stokes’s theorem, we obtain the 
Ampere-Maxwell Law in integral form 

<j) B-d£ = p 0 J^J + J D )-dA. ( 11 . 10 ) 

Written more simply, 

^B dt = pQU + I D ) (11.11) 

where I is the charge current and Io the displacement current through S. 

In laboratory systems of charges and currents, the displacement current is nor¬ 
mally very small compared to typical charge currents, which explains why the 
displacement current was not discovered experimentally before Maxwell’s the¬ 
ory. To observe J/j experimentally requires high frequencies, large |E|, or both. 
Nevertheless, the displacement current is an essential part of electromagnetism 
because it is necessary to the propagation of electromagnetic waves, as we shall 
see. Indeed, the first experimental confirmation of the displacement current came 
with Hertz’s laboratory demonstration of electromagnetic waves in 1887, more 
than 20 years after the publication of Maxwell’s theory. A later direct labora¬ 
tory observation of the displacement current, i.e., not in electromagnetic waves 
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but in a system of charges and currents, was an ingenious experiment by M. R. 
Van Cauwenberghe in 1929. 1 

The Maxwell equations have wave solutions (Sec. 11.5) that describe light and 
other forms of electromagnetic radiation. The first production of electromagnetic 
waves in a laboratory experiment, which verified the Maxwell theory, was by 
Heinrich R. Hertz (1857-1894) in 1887. He discharged an induction coil through 
a spark gap, and observed small sparks in the spark gap of a receiver coil across 
the room. After studying this small effect in detail, he explained it as a conse¬ 
quence of electromagnetic waves traveling from the induction coil to the receiver. 
Hertz made this discovery as a result of basic scientific research, whose purpose 
was to clarify the role of electrical oscillations in Maxwell’s theory. But Hertz’s 
research had an immensely important practical application because it led to the 
development of radio communication. The waves he produced and whose prop¬ 
erties he studied had wavelengths around 10 m; i.e., frequency 30 MHz, which 
falls between the AM and FM radio frequency bands and is now used for amateur 
hf-radio . 2 

The Maxwell equations are a unified field theory of electricity and magnetism. 
Furthermore, optics is a part of the theory, because light is an electromagnetic 
phenomenon described by Maxwell’s equations. 


EXAMPLE 1 Charging a capacitor, Part 1 . What are the fields E(x, t) and 
B(x, t) in and around a capacitor as it is being charged? 

Figure 11.1 shows a parallel plate capacitor being charged by a current I that 
flows in wires along the z axis. The capacitor plates are disks with radius a. We 
will use the Ampere-Maxwell Law (11.7) to find B on the three circular paths Ci, 
C 2 , and C 3 shown in the figure. 

The charges on the left and rightplates are, respectively, Q(t) and —Q(t)\ so 
the electric field between the plates, i.e., for r < a, is 


E(r) = 


rr(f) r 
—-k 
eo 


Qit) 
eon a 2 


( 11 . 12 ) 


by Gauss’s Law, which holds for charge distributions that depend on time as well 
as for static charge distributions. Because both current densities J and Jd in (11.9) 
are in the z direction, so that V x B is in the z direction, the magnetic field is az¬ 
imuthal, of the general form B = B$(r, z)<p in each region. There are minor 


1 See Exercise 1. 

2 Richard P. Feynman (1918-1990) liked to use this history to emphasize how basic research can lead 
to the most important of applications, even more effectively than applied research. To paraphrase how 
Feynman put it: If Hertz had been hired by a company to find a way to communicate by voice over 
long distances, say between New York and California, then he would have experimented with “long 
tubes” instead of with inductors, oscillators, and sparks. Feynman in his lecture would demonstrate 
how Hertz might shout into the tube: "Hello! It’s me, Hertz, in New York! Can you hear me yet in 
Pittsburgh?” 
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FIGURE 11.1 Charging a capacitor. The parallel plate capacitor is being charged by 
the current / = dQ/dt along the z axis. The plates are disks of radius a separated by 
distance d with d < 3 C a. (The separation is exaggerated in the figure.) C\, C 2 , and C 3 are 
circular Amperian loops, on which B can be determined by (11.10). 


corrections to these ideal fields, called fringing fields. The fringing fields are neg¬ 
ligible between the plates if the plates are close together. They are only important 
near the edges. 

Region 1: What is B in the regions to the left and right of the capacitor? 

We integrate (11.7) over the disk Si bounded by Ci, with radius r. By Stokes’s 
theorem the flux of V x B through Si is equal to the circulation of B around Ci, 
so the field equation implies 

® B • di = no f J • rnM + Moeo 37 - f E n dA. (11.13) 

JC\ JSi °t J S] 

Here the normal vector n is k, and the direction of di along Ci is defined by 
the right-hand rule. The charge current through Si is I so that the first term on the 
right side of (11.13) is /x 0 /. The displacement current through Si, expressed in the 
second term on the right, is zero in the quasistatic approximation . 3 The integral 
of E n is not strictly zero because there is an E in the wire and an E\\ near it, but 
unless the frequency is very high the displacement current is negligible compared 

3 The quasistatic approximation is the somewhat subtle issue that we have mentioned in previous chap¬ 
ters. In this approximation we use formulas derived from electrostatics or magnetostatics, to describe 
fields in systems that are changing in time. If the rate of change is slow, the approximation is accurate. 
In fact, the quasistatic approximation is quite accurate except in extreme cases. We will discuss the 
quasi static approximation in more detail in Example 3. 
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to I. Therefore (11.13) becomes 


B^lnr — im)I, 


(11-14) 


where the azimuthal symmetry has been used to evaluate the line integral; or, 


B 


noi 


0 2nr 


(11.15) 


This calculation and its result are the same as for an infinite straight wire with cur¬ 
rent I. Here I may vary in time. Had we not made the quasistatic approximation 
there would be a small additional contribution to B$ from displacement current. 


Region 2: What is B along C 2 , i.e., between the plates in z, but outside the 
cylinder (r > a)l 

In this case we integrate (11.7) over the disk S 2 bounded by C 2 , and ob¬ 
tain again an equation like (11.13), but now with integrals over C 2 and .S’ 2 . The 
charge current through S 2 is 0. The displacement current, using the electric field 
in ( 11 . 12 ), is 


I D (S 2 ) — e o~ f E z dA= eo^-4na 2 = / (11.16) 

dt Js 2 e 0 7Ta l 

because dQ/dt — I by conservation of charge. Therefore the magnetic field 
in Region 2 is again given by (11.15), the same as that of a straight wire. In 
calculating the magnetic field in this region we have, as a good approximation, 
ignored the fringing field of E. 

Region 3: What is B along C 3 , i.e., between the plates in z, and inside the cylin¬ 
der (r < a)? 

Again we integrate (11.7), this time over the disk S 3 bounded by C 3 , and ob¬ 
tain an equation like (11.13), but now with integrals over C 3 and S 3 . The charge 
current through S 3 is 0. The displacement current is 

3 f dQ/dt 7 7r 2 

Id(S^) = € 0 — E z dA — e 0 - jJt (11-17) 

at Js 3 eoa z 


Thus, by Stokes’s theorem, 


B*{r) = 


j-tplr 

2na 2 


(11.18) 


The magnetic field in Region 3 is the same as if the capacitor were replaced by 
a conducting cylinder, of radius a, carrying total current I uniformly distributed 
across its cross section. As a source of B the displacement current is equivalent to 
a charge current. 
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B(x, t) is continuous at the cylindrical surface r — a because there is no sur¬ 
face current. However, B is discontinuous across either plate. The discontinuity 
of B is related to the radial surface current that flows in the plate as the capac¬ 
itor is being charged, in accord with the general boundary condition at a current 
sheet. 4 


Example 1 can be used to understand in another way why the displacement 
current is logically necessary for consistency of the field equations. Figure 11.2 
shows an Amperian loop C, in the form of a circle around the wire to the left of 
both capacitor plates. According to Stokes’s theorem, the circulation of B around 
C is equal to the flux of /u,o(J + Jd) through any surface S bounded by C; this 
is (11.10) expressed in words. Therefore this flux must be the same for all such 
surfaces. First consider for 5 the disk S a bounded by C. The flux through S a 
is nol, where I is the current in the wire, because J passes through S a but 3d 
is 0 on S a . Next consider instead the bag-shaped surface Sb, whose boundary is 
also C. The flux through S/, must also be no I for mathematical consistency. But 
there is no charge current through Sb, so without the displacement current there 
would be an inconsistency. There is displacement current through Sb given by 
Id = d(DA)/dt where A — it a 2 is the area of the plates. Now, by Gauss’s law 
DA is equal to Q, the charge on the positive plate, so Id = dQ/dt — I. In other 
words, the displacement current through Sb equals the charge current through S a , 
both being I. The theory is self-consistent. 

The displacement current is not a flow of charge, but a variation of D. The 
above discussion is based on the idea that both charge current and displacement 



FIGURE 11.2 Why the displacement current is necessary. The capacitor is being 
charged by a current I. The circle C is the boundary curve of both the disk S a and the 
bag-shaped surface Sb- By Stokes’s theorem, the displacement current through Sb must be 
equal to the charge current through S a . 

4 See Exercise 3. 
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current produce B. While this view is self-consistent, there are some subtle ques¬ 
tions about “cause” and “effect.” In a region where J = 0, (11.7) describes a 
correlation between the spatial variation of B and the temporal variation of E 
(or D). But who is to say which is the cause and which the effect? For physical 
intuition it is helpful to think of 3D/3 t as a kind of current density, but a more 
precise statement is that 3D/3 1 contributes to the curl of B in the same way as J. 5 


11.2 ■ SCALAR AND VECTOR POTENTIALS 

We now tum to an interesting and important general problem. How can we find 
the time-dependent fields E(x, t ) and B(x, t ) produced by time-dependent charge 
and current sources in vacuum? Much of modem communication technology, e.g., 
radio, television, and satellite communication, are applications of this problem. 

The Maxwell equations contain all that is needed to solve the problem. How¬ 
ever, calculating the fields directly from the sources is often difficult, and rarely 
the best approach. Rather, it is better to introduce potential functions —time- 
dependent scalar and vector potentials—whose mathematical connection to the 
sources is easier to handle. Once these potentials are known, the fields can be 
calculated by differentiation. 

This situation is similar to what we found in electrostatics and magnetostatics. 
Recall the advantages of finding the potentials, namely V(x) for electrostatics 
and A(x) for magnetostatics, and then calculating E = — V V and B = V x A. 
First, the equations connecting the potentials and sources are simpler than the 
equations relating fields and sources. Also, the fields E and B have altogether six 
components, but the potentials V and A have only four. The same advantages 
exist for time-dependent problems. 

It is straightforward to generalize the potentials to the case of time-dependent 
fields. Any vector function whose divergence is 0 can be written as the curl of 
another vector function. Therefore (11.2) implies that we can always find a vector 
potential A(x, t) such that 


B = V x A. 


(11-19) 


Substituting this form of B into (11.3) gives 


V x 



( 11 . 20 ) 


where we have used the fact that V and 3/3 1 commute. Now, any vector function 
whose curl is 0 can be written as the gradient of a scalar function. That is, (11.20) 
implies that we can always find a scalar potential V (x, t) such that 


„ 3A 
E + —— = 
3 1 


-VV, 


5 A. P. French, The Physics Teacher , 38, 274 (2000). 


( 11 - 21 ) 



406 


Chapter 11 The Maxwell Equations 


or, 


E = —VV-. (11.22) 

3 1 

(Note that the familiar electrostatics equation E = —VV is not true for time- 
dependent fields.) 

Equations (11.19) and (11.22) are a reflection of the Helmholtz theorem. 6 The 
theorem states that any vector field can be separated into a solenoidal part, which 
can be written as a curl, and an irrotational part, which can be written as a gradient. 
In (11.19) the purely solenoidal field B is written as the curl qf A. In (11.22) the 
purely irrotational function E + 3A/3 1 is written as — V V. 

With the fields B(x, t ) and E(x, t ) expressed in terms of the potentials A(x, t ) 
and V(x, t ) by (11.19) and (11.22), two of the Maxwell equations are automat¬ 
ically satisfied, namely (11.2) and (11.3). To proceed we must rewrite the other 
two Maxwell equations in terms of A and V, obtaining the equations that will 
determine the potentials for whatever sources are present. But before deriving 
equations for A and V , we address the question of uniqueness. 

11.2.1 ■ Gauge Transformations and Gauge Invariance 

Are the potentials A(x, t ) and V (x, t) uniquely determined by the charge and 
current sources in a system? The answer is that these functions are not uniquely 
determined, a fact that creates some richness in the theory. 

Of course the electric and magnetic fields are uniquely determined if the 
sources p(x, t ) and J(x, t ) and boundary conditions of the system are established. 
After all, the fields are physical quantities, that could be measured by suitable 
experimental apparatus under the given conditions. In contrast, the potentials are 
introduced as a mathematical convenience; at least that is true in classical electro¬ 
magnetism. 7 There are no classical instruments for measuring V (x, t ) or A(x, t), 
and the potentials do not have direct physical significance. They are not even 
unique for a given electromagnetic system. 

If A(x, t) and V (x, t ) describe the fields of a certain system, then other poten¬ 
tial functions, which also describe the same fields, can be constructed by transfor¬ 
mations called gauge transformations. Let f(\,t) be an arbitrary scalar function. 
The gauge transformation associated with /(x, t ) is 

A —> A' = A + V/ (11.23) 

V—» V' =y ——. (11.24) 

3 1 

In words, the arrows (—>) indicate that we replace the potentials A and V by 
new potentials A' and V (the transformed potentials) that differ from A and V 

6 The Helmholtz theorem is presented in Appendix B. 

7 The Bohm-Aharanov effect in quantum electrodynamics is a physical effect that is directly sensitive 
to the vector potential A. 
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by V/ and —df/dt, respectively. The fields E and B are the same for the primed 
potentials as for the unprimed potentials. The curl of A' is equal to the curl of A 
because V x V/ = 0, so B is the same for the two potentials. Similarly, — VV' — 
3A'/3 1 is equal to — V V — 3A/3 1 because 

v (!rH <v/) = 0 ' 

so E is the same for the transformed and original potentials. Thus the transformed 
potentials, for any /(x, t), can be used as potentials for the same fields B and E. 
In other words, B and E are invariant with respect to the gauge transformation 
(11.23) and (11.24). The theory is said to be gauge invariant. 

We have previously encountered similar arbitrariness in defining potentials for 
static problems. In electrostatics, V (x) + C for any constant C gives the same 
electric field as V(x). In magnetostatics, A(x) + V/(x) for any function /(x) 
gives the same magnetic field as A(x). But the full gauge invariance with respect 
to the transformations (11.23) and (11.24) is more significant because it mixes 
A(x, t) and V(x, t). 


EXAMPLE 2 Scalar and vector potentials for a point charge at rest at the ori¬ 
gin. A natural choice of potentials for this static problem is 

V(x, t) — ——— and A(x, t) = 0. (11.25) 

4jteor 

The fields are E = qr/(4jteor 2 ) and B = 0. 

But now make the gauge transformation with /(x, t ) — qt /(Ane^r). The 
transformed potentials are 


V'(x, t) =0 


and 


A'(x, t) = 


—qtr 
Aneor 2 ' 


(11.26) 


The primed potentials give exactly the same fields as the unprimed potentials. 
However, the primed potentials look very surprising, because A' depends explic¬ 
itly on t even though the sources and fields are all static! This example is addi¬ 
tional evidence that the potentials do not have direct physical significance. The 
freedom to construct gauge transformations even permits time-dependent poten¬ 
tials for time-independent fields. 


11.2.2 ■ Gauge Choices and Equations for A(x, t) and V(x, t) 

We are almost ready to substitute the fields, written in terms of A and V, into 
the Maxwell equations (11.1) and (11.7). However, there is one more preliminary 
step. It is generally useful to eliminate the gauge ambiguity by imposing some ad¬ 
ditional condition on the potentials A(x, t) and V (x, t), rather than leaving their 
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form completely arbitrary. The additional condition is called a gauge choice. Be¬ 
cause of gauge invariance we have great freedom in the form of the potentials. We 
make use of this freedom to choose a convenient, or simple form. 

Coulomb Gauge. One example of a gauge choice is called the Coulomb gauge. 
The condition imposed in this case is 

V • A = 0. (11.27) 

This condition makes A unique, because the gauge transformation (11.23) would 
in general produce a transformed potential that does not satisfy the condition 
(11.27). In other words, jf A satisfies the condition (11.27) then A' of (11.23) 
would not satisfy the condition unless V 2 / = 0; imposing a boundary condition 
on the vector potential, e.g., at infinity, would then require that / is constant, so 
that A' is just the same as A. This means therefore that for given sources and 
boundary conditions there is only one A(x, t) that gives the correct fields and also 
satisfies (11.27). The scalar potential for the problem is also uniquely determined, 
except for an arbitrary additive constant. 

The scalar potential is particularly simple in the Coulomb gauge, because then 
Gauss’s law (11.1) reduces to — V 2 V = p/eo- This is the same equation as in 
electrostatics—Poisson’s equation—and we can solve it formally with the Green’s 
function of —V 2 , 


V(x, t ) = 


1 r p(x', t)d?x' 

4tre 0 J |x-x'| 


(11.28) 


But note that this function has an unphysical aspect: The potential V at an arbi¬ 
trary field point x and time t depends on the charge density at all source points 
at the very same time t. Equation (11.28) implies that no matter how far the field 
point is from the source, there is an instantaneous response in V (x, t ) to a change 
in p(x', t). This instantaneous response at arbitrary distance would be physically 
unacceptable if V (x, t ) were measurable. In contrast the fields, which are measur¬ 
able, do not respond instantaneously to a change in p(x', t)\ rather, there is a time 
delay—the time difference being the time for light to travel from x' to x. 8 There 
is no inconsistency in the theory because E(x, t ) is determined by a combination 
of both A(x, t ) and V (x, t), as we see in (11.22). The contribution to E from 
—3A/3 1 cancels the instantaneous changes from V. The instantaneous response 
of V (x, t) in the Coulomb gauge is just an artifact of the gauge choice. 

A disadvantage of the Coulomb gauge is that the vector potential A(x, t ) will 
in general be complicated if the sources vary in time. For this reason the Coulomb 
gauge is only useful for static systems, or to describe free electromagnetic waves. 
It is not a convenient gauge choice for problems involving the generation of radi¬ 
ation. 

8 We shall prove in Chapter 15 that the response of the fields to a change in the sources occurs at the 
retarded time. 
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Lorentz Gauge. A different gauge choice, very useful for time-dependent prob¬ 
lems like radiation, is called the Lorentz gauge 9 10 for which the gauge condition is 

dV 

V\ = -n 0 e 0 —. (11-29) 

For this gauge choice the source equations (11.1) and (11.7) take a particularly 
attractive form, because V and A separate into independent equations. First, con¬ 
sider Gauss’s law (11.1). From (11.22) V ■ E is — V 2 V — 3(V • A)/3 1. Use the 
gauge condition (11.29) to eliminate V • A for V. The result is a source equation 
involving V alone, 


—V 2 V + /x 0 eo 


d 2 V 
9 T 


p_ 

eo’ 


(11.30) 


Similarly, the Ampere-Maxwell equation reduces to an equation for A alone, 


10 


2 , 3 2 A 

-V-A + Mo^o—r = MoJ 


(11.31) 


Note that the equations for V and A are similar. The r'th Cartesian component 
A, with source /xq Ji satisfies an equation of the same form as V with source 
p/e o. We shall use the Lorentz gauge in Chapter 15 to analyze the radiation of 
electromagnetic waves by charges or currents that vary in time. There (11.30) and 
(11.31) will be solved. The results will be solidly intuitive. 

It is also possible to write the general equations for V(x, t) and A(x, t), i.e., 
without specifying any gauge condition. 11 However, the resulting equations are 
not very edifying—coupled scalar and vector partial differential equations—and 
not often used in practical calculations. The usual approach to finding V and A 
is to make a gauge choice as a first step. It should be verified at the end of the 
calculation that the solution does indeed satisfy the gauge condition. 


EXAMPLE 2 Example 2 revisited. Are the original potentials (11.25), for a 
point charge at rest at the origin, in the Coulomb or Lorentz gauge? Because 
V • A = 0 and dV/dt — 0 these potentials satisfy both (11.27) and (11.29). They 
are in bdtth the Coulomb gauge and the Lorentz gauge. 

Are the transformed potentials (11.26) in the Coulomb or Lorentz gauge? 
These potentials are in neither the Coulomb gauge nor the Lorentz gauge, be¬ 
cause V • A' = —(qt/e o)<5 3 (x) and 3 V'/dt = 0. Recall that <?<5 3 (x) is the charge 
density of a point charge q at the origin. 


9 The first use of the Lorentz gauge may have been by Ludwig Lorenz of Austria, rather than Hendrik 
Lorentz of the Netherlands. We call (11.29) the Lorentz gauge because the condition is invariant under 
the Lorentz transformations of special relativity. 

10 See Exercise 4. 

1 'See Exercise 5. 
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11.3 ■ THE MAXWELL EQUATIONS IN MATTER 

When we wrote the Maxwell equations in vacuum (see Table 11.1) we allowed 
only isolated sources p and J, but not dielectric or magnetic materials. We now 
consider the modification of the equations necessary to describe systems with 
such materials. 

Equations (11.1) to (11.3) and (11.7) are still true for systems with matter, pro¬ 
vided that we interpret p and J to include the bound charge and current—bound 
within atoms of the material. The underlying idea is that dielectric and magnetic 
materials, as indeed all matter in the Universe, consist of particles that are sources 
of, and interact with, the fields. The difficulty in solving Maxwell’s equations from 
that starting point is that in order to find p(x, t) and J(x, t ) we’d need detailed 
knowledge about the microscopic nature of the particles in atoms and molecules, 
which would be very complicated. Therefore the vacuum equations are not a prac¬ 
tical formulation of the theory in the presence of matter. Instead, we rewrite the 
equations by separating the free charge from the bound charge. We are not in¬ 
troducing new fundamental principles, but writing the equations in a form that is 
more convenient to account for the effects of the matter. 

The way that matter interacts with the electromagnetic field is determined by 
the fact that matter is atomic. Atoms and molecules respond to electromagnetic 
fields mainly as electric and magnetic dipoles. As we learned in earlier chapters, 
the origin of the electric dipoles is the separation of positive and negative charges 
within atoms, and the origin of the magnetic dipoles is the current associated 
with electron orbital motion and spin. We have learned how to take these effects 
into account for static fields. In the case of a static electric field we introduced in 
Chapter 6 the polarizationfield P(x), the electric dipole moment density in matter 
as a function of position x. In the case of a static magnetic field we introduced in 
Chapter 9 the magnetization field M(x), the magnetic dipole moment density of 
the matter. 

The same matter fields P(x, t) and M(x, t), i.e., the dipole moment densities, 
account for the response of matter to time-dependent fields, but now the polariza¬ 
tion and magnetization may depend on time. Recall from the static cases that P 
or M describes the combined effect of many atoms in a neighborhood of x. The 
response of any single atom to a time-dependent field is complicated and quantum 
mechanical, but summing over the huge number of atoms in even a small volume 
yields the simple classical polarization fields. 

11.3.1 ■ Free and Bound Charge and Current 

Turning first to Gauss’s Law (11.1) we know that when dielectrics are present the 
source of E will include bound charge Pb as well as free charge Pf. It is convenient 
to treat these sources separately, so we write 

pi, 

V • E = — = — (Pf + Pb) • 
f() e<) 


(11.32) 
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The field E(x, t) and the densities pf(x, t) and Pb( x > 0 in (11.32) may depend 
on time. The bound charge density pt, describes the effect of atomic dipoles as 
sources of electric field. It is given by Eq. (6.12) 

Pb = -V-P, (11.33) 

which is valid for fields with arbitrary time dependence because Gauss’s Law does 
not involve the time derivative. The displacement field is by definition (6.28) 

D = e 0 E + P, (11.34) 

so Gauss’s Law may be written as 

V D = p f . (11.35) 

Equation (11.35) is Gauss’s Law in the presence of dielectrics. This treatment 
of electric polarization for time-dependent fields has the same form as for static 
fields in Chapter 6. 

The second and third Maxwell equations, (11.2) and (11.3), are universal— 
unchanged in the presence of dielectric or magnetic materials. 

Finally we tum to the Ampere-Maxwell Law (11.7). Fundamentally, there are 
two source terms for B, proportional to J and 9E/9 t. To describe the interaction 
between matter and a magnetic field, we separate the current density J into free 
current, bound current, and polarization current 

J = Jf + Jb + Jp- (11.36) 

The bound current, an effective current density equivalent to the atomic magnetic 
dipoles from the magnetization, is defined by Eq. (9.12) 

Jb = V x M, (11.37) 

which we encountered when studying magnetic materials in Chapter 9. For time- 
dependent fields we must also include the polarization current, which does not 
appear in magnetostatics, defined by 

9P 

Jp=—• (11.38) 

at 

The polarization current is the current associated with change of electric polariza¬ 
tion. Polarization is the separation of positive and negative charge, bound together 
in the atom. A change of polarization occurs when positive charge shifts one way 
and negative charge another. That motion of charge is a current—the polarization 
current. 

Figure 11.3 explains the connection between P, pb , and Jp. Because matter is 
electrically neutral, bound charge and polarization current satisfy a charge con¬ 
servation equation. By their definitions (11.38) and (11.33) we have 
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FIGURE 11.3 The polarization current. Three cells inside a dielectric are shown, and 
in each cell a typical atom polarized by the presence of an electric field. (There are many 
atoms in a cell.) In the state (a), P(x) is constant and p(, is 0. In the state (b), P(x) has 
nonzero divergence (P increases to the right so V • P > 0) and Pb = — V ■ P is nonzero. The 
net charge in the center cell is negative because to go from state (a) to state (b) more positive 
charge passes out of cell through the right boundary than passes into the cell through the 
left boundary. If the material evolves in time from state (a) to state (b), then during die 
change there is a current Jp directed toward the right as positive charge moves to the right; 
that is the physical basis for (11.38). However, the individual charges remain bound in their 
atoms. The divergence of Jp is —dp\,/dt by conservation of charge. 


This result justifies the definition (11.38) for Jp: A variation of the bound charge 
is a current, and the current out through a closed surface must equal the rate 
of decrease of charge enclosed, because charge is conserved. Furthermore, it is 
straightforward to show from (11.39) that the total charge p = Pf + p\, and cur¬ 
rent (11.36) satisfy the full continuity equation (11.4), which is the fundamental 
equation. 

Now, when (11.36) is substituted into the Ampere-Maxwell Law, the field 
equation becomes 

/ ap\ 9E 

V X B = Mo I jf + v X M+ — 1 -Kioeo —, (11.40) 

from which, by rearranging terms, 

( B \ 3 

-M J = J f + — (e 0 E + P). (11.41) 

/to / 9r 


Or, finally we have 


3D 

V x H = Jf + —, 


( 11 . 42 ) 
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where H is the magnetic vector defined in (9.23) 

D 

H =-M. (11.43) 

Mo 

Equation (11.42) is the Ampere-Maxwell Law in materials. Jf is the free current 
density and 3D/3 t is the displacement current density in matter. 

The Maxwell equations in matter are (11.35) and (11.42), together with the 
universal equations (11.2) and (11.3), which are the same in matter or in vacuum. 
These are tabulated in the third column of Table 11.1. The sources pf and Jf are 
just the free charge and current. 

To use equations (11.35) and (11.42) we must know from some independent 
information how polarization and magnetization depend on E and B. In many ap¬ 
plications of electromagnetism in matter, the polarization is proportional to E and 
the magnetization is proportional to H, for time-dependent fields if the frequency 
isn’t too high, as well as for static fields. Materials with this property are called 
linear materials. The constants of proportionality, called the electric and magnetic 
susceptibilities Xe and x m , are conventionally defined by 

P = 6 0 x,E, (11.44) 

M = XmH. (11.45) 

For linear materials D is proportional to E, and B is proportional to H; that is 

D = eE, (11.46) 

B = p.H, (11.47) 

where the constant e is called the permittivity of the material, and p the per¬ 
meability. These relations specifying material properties are called constitutive 
equations. In terms of susceptibilites, e = eo(l + Xe) and p = po(l + Xm )• The 
constitutive equations are summarized in Table 11.2. 

The most important application of Maxwell’s equations in matter is to the the¬ 
ory of optics—the interaction between electromagnetic waves and matter—a sub¬ 
ject we shall study in detail in Chapter 13. 

11.3.2 ■ Boundary Conditions of Fields 

The Maxwell equations are partial differential equations, and whenever partial 
differential equations occur in physics, the question of boundary conditions arises. 


TABLE 11.2 Definitions and constitutive equations for linear materials 


Material 

Field definitions 

Susceptibilities 

Dielectrics 

D = 6qE + P = eE 

P = eoXeE 

Magnetic materials 

B = po(H + M) = pH 

M = XmH 
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What are the conditions that must hold at a boundary between different materials? 
Which fields are continuous? For field components that are discontinuous, what 
are the discontinuities? We can answer these questions by analyzing Maxwell’s 
equations at the interface between the materials. 

Let S be the surface separating two linear media, with material constants ei 
and ji\ on one side of S, and e 2 and /z 2 on the other side. We have previously 
determined the following boundary conditions for static fields: 


Di± — £>i_l = CTf (11.48) 

S 2 ±-S u =0 (11.49) 

E 211 —Em = 0 (11.50) 

H 2 || — Hi || = K f x n (11.51) 


where _L and || indicate vector components perpendicular and parallel to S, re¬ 
spectively, and n is normal in the direction 1 —2. The very same boundary 
conditions are true for time-dependent fields. 

Recall how these boundary conditions were derived. Equations (11.48) and 
(11.49) were derived by integrating the flux of D or B over a closed Gaussian 
surface that cuts through the interface—a pill box with height <5 that tends to 0 (see 
Fig. 11.4(a))—and applying Gauss’s theorem. By (11.35) the flux of D is <jfdA\ 
hence (11.48). By (11.2) the flux of B is 0; hence (11.49). The same analysis 
applies to the time-dependent Maxwell equations, and leads to the same boundary 
conditions, because the field equations (11.35) and (11.2) do not involve the time 
derivative. 

Equations (11.50) and (11.51) were derived for static fields by considering the 
circulation of E or H around a rectangular Amperian loop C cutting across S, 
and applying Stokes’s theorem. Figure 11.4(b) shows such a loop, with tangential 
and normal dimensions l and 8. In the time-dependent field equations there are 
additional terms involving the time derivative, but they do not contribute to the 



(a) (b) 


FIGURE 11.4 Boundary conditions of E and B. S is an interface between two regions, 
(a) A Gaussian pill box G of height 8 cutting through S. (b) An Amperian loop C of height 
S cutting through S. 
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discontinuity across S because the fields are nonsingular on S. For example, the 
circulation of E is —d^s/dt, where <$>u is the magnetic flux through a surface 
bounded by C. As the height 8 of C approaches 0, the flux —> 0 because 

the area £8 —*■ 0 while B remains finite; hence (11.50). Similarly the circulation 
of H is ju 0 (/ f + Id). As 8 —*■ 0, the displacement current Id 0 because again 
the area £8 -* 0 while D remains finite; the free charge current approaches the 
surface current integrated across the length £ of the loop C; hence (11.51). 

Where We Now Stand and How We Cot Here 

We’ve come quite far in our study of electromagnetism, so it’s interesting and 
useful to look back on our path. 

We now have Maxwell’s equations in matter, written in the right column of 
Table 11.1. Taken together with the linear constitutive equations in Table 11.2, 
Ohm’s law in the form J = crE, and the boundary conditions (11.48) to (11.51), 
we therefore have everything necessary to solve, in principle, the following very 
general problem in electrodynamics: What are the electric and magnetic fields 
for arbitrary time-dependent charge and current sources in the presence of linear 
dielectric, magnetic, and conducting materials? 

Recall that we started our work by first studying the equations V • E = p/eo 
and V x E = 0. With those PDEs, and boundary conditions, we could find E in 
vacuum for static charge sources. Second, we studied the equations V • D = Pf 
and V x E = 0. With those, and a constitutive equation and boundary conditions, 
we could find E and D in the presence of dielectrics and static sources. Third, 
we turned to magnetostatics, and with V ■ B = 0 and V x B = poJ we could 
find B in vacuum for static current sources. Fourth, we included magnetic materi¬ 
als, and found that with the equations V • B — 0 and V x H = Jf, together with a 
constitutive equation and boundary conditions, we could find B and H in the pres¬ 
ence of magnetic materials and static currents. With our work in this chapter, we 
are finally able to include arbitrary time dependence in our problems. Maxwell’s 
equations in vacuum, as written in the middle column in Table 11.1, together with 
boundary conditions, enable us to find E(x, t) and B(x, t ) for time-dependent 
sources p(x, t ) and J(x, t). 

In the interest of completeness, we must remark that there are more general 
problems than the one in italics above. These would include materials with non¬ 
linear electric and magnetic properties. In order to solve such problems we’d need 
more general constitutive equations, say of the form D = D(E), H = H(B), and 
J = J(E, B), or even more general relations. However, we will not consider such 
difficult problems. 


11.4 ■ ENERGY AND MOMENTUM OF ELECTROMAGNETIC FIELDS 

In general, electromagnetic fields possess both energy and momentum. The en¬ 
ergy and momentum flow through space as the fields change. In previous chapters 
we deduced the equations for energy density in static fields, namely ue = (oE 2 /2 
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and ub = B 2 /(2ho) for fields in vacuum. The purpose of this section is to de¬ 
rive the equations for energy flow and energy density of arbitrary time-dependent 
fields. Field momentum is also described, although its derivation is postponed 
until Chapter 12. As we shall see in Chapter 12, it is natural to study energy 
and momentum together because they are linked by relativity, as are time and 
space. 


11.4.1 ■ Poynting's Theorem 

What is energy ? Energy is a conserved quantity, defined by equations, associated 
with the ability to do work. A defining property of energy is that the total energy 
in an isolated system is constant. More precisely, if work is done by one part of 
the system on another part, then the first part loses energy equal to the work done 
and the second part gains an equal amount of energy. 

In the case of field energy, an even stronger statement is true: Energy is not 
just conserved overall, but it is locally conserved, i.e., point-by-point in space. To 
derive the correct equations for the energy density and energy flux, we require 
that energy be locally conserved. 

We studied in Chapter 7 another example of a local conservation law— 
conservation of electric charge. The equation for charge conservation is the 
continuity equation (11.4) where J is the flux and p is the density. RecalTfrom 
Sec. 7.2 that local conservation of charge implies that if the net charge within any 
volume changes, then there must be a corresponding current flow, i.e., a net flux of 
charge, across the boundary surface. Equation (11.4) expresses this conservation. 
We will derive the formulas for energy flux and energy density by demanding that 
they satisfy a conservation equation analogous to (11.4). 

For the case of energy, picture an arbitrary volume, either finite or infinitesimal, 
containing charges and fields as in Fig. 11.5. If the total energy inside this volume 
is to change then there must be a net energy flux across the boundary surface. 
It is necessary to account for both the field energy and the kinetic energy of the 
charged particles, because in general the fields and particles will exchange energy 
as they interact. That is, we seek an equation of the form 


3 u dure 

37 ~~ ~ 3 7 ' 


(11.52) 


where S is the field energy flux (energy flow per unit time per unit area), u is 
the field energy density (energy per unit volume), and uk is the particle kinetic 
energy density. (For simplicity we assume that the only forces on the particles 
are electromagnetic, so we do not need to account for other potential energies of 
the particles.) By Gauss’s theorem, equation (11.52) says that the rate that energy 
flows out through the surface bounding a small volume at x equals the rate that 
energy decreases inside the volume. 

If a particle with charge q moves, then the change of its kinetic energy K is 
equal to the work done on q ; that is, d K = F • \dt. This relation is called the 
work-kinetic energy theorem. The force is q(E + v x B) but the magnetic force 
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FIGURE 11.5 Poynting’s theorem. Many kinds of energy transfer occur in a system of 
fields and charged particles, e.g., an antenna or a plasma. As charges are accelerated by E 
the kinetic energy density ur changes. Also, the E and B fields change with time as the 
charges move, so the field energy density u changes. Waves may propagate away from the 
system with energy flux S = E x B//ro- Poynting’s theorem states that energy is conserved 
locally throughout the system. 


does no work, so 


dK 

dt 


= q E • v. 


(11.53) 


This result is for a single charged particle. To analyze a general continuous distri¬ 
bution of charge, with charge density p(x, t) and kinetic energy density uk(x, t), 
consider the total kinetic energy in an infinitesimal volume d 3 x at x. In this case 
the work-kinetic energy theorem is 

— d- 3 x = (pd\x ) E v, (11.54) 

dt 

completely analogous to (11.53), where v is the mean velocity of charges at x. 
Now, pv is the current density J(x, t). Thus the rate of change of particle kinetic 
energy per unit volume is 


duK 

dt 


= E J. 


(11.55) 


Using this formula for the kinetic energy term in (11.52) the continuity equation 
expressing conservation of energy is 


VS = - 


3 m 

a7 


-E J. 


(11.56) 
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This equation, together with the formulas for S(x, t) and m(x, t) derived below, is 
called Poynting’s theorem} 1 

Poynting determined the formulas for the energy flux S(x, t) and energy den¬ 
sity m(x, t) of electromagnetic fields; his results, which we shall verify, are 


S = — E X B, 

MO 

(11.57) 

u = € 1e 2 + —b 2 . 

2 2/r 0 

(11.58) 


The fields and S and u are all functions of position x and time t. In the case of the 
energy density w, (11.58) is the same as the equations we deduced previously for 
static fields; but here the derivation is more rigorous because it is based on local 
conservation of energy. The energy flux S(x, t ) is called the Poynting vector. The 
units of S are W/m 2 . The meaning of S is that if d\ is a small area in the field, 
then the energy passing through d\ in the time dt is (S • d\)dt. 

To verify (11.56) from Maxwell’s equations is an exercise in vector calculus. 
Start with E • J, and substitute for J, from the Ampere-Maxwell law, 

J=-VxB-e 0 ?. (11.59) 

Mo dt 

Now simplify E • J using various identities of vector calculus. For example, note 
that 


E • (V x B) = -V ■ (E x B) + B • (V x E) 
= -V (E x B) -B (9B/90- 


The first line is a vector identity, and the second follows from Faraday’s law. Also, 


9E 1 dE 2 

E • — =- 

dt 2 dt 


9B 1 dB 2 
and B • — = —. 

dt 2 dt 


Therefore, putting it all together, 

Ej =- v (i E * B KG s » £2+ i4 <n 


60) 


Poynting’s theorem (11.56) follows immediately, with S and u in (11.57) and 
(11.58). Poynting used the same argument to obtain these results in 1884. 

Equations (11.57) and (11.58) describe the field energy in vacuum. We used 
the vacuum form of Maxwell’s equations to derive (11.56). If the calculation is 
redone for fields in matter the appropriate formulas are 13 

12 J. H. Poynting was a professor of physics from 1880 until his death, at a school that became the 
University of Birmingham. Bom 1852 in Monton, Lancashire. Died 1914 in Birmingham. 

13 See Exercise 7. 
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S = E x H, (11.61) 

u = ±(E D + H B). (11.62) 

In this case, the energy transfer term 3 u% /3 1 is E • Jf, the work on the free charge 
only, because S and u include the internal energy of the bound charge and current 
as well as field energy. Strictly speaking, (11.61) and (11.62) are only correct for 
linear materials. 


EXAMPLE 3 Charging a capacitor, Part 2. In Chapter 3 we argued that the en¬ 
ergy of a capacitor belongs to the electric field. We can use Poynting’s theorem to 
see how the electric field energy builds up in the capacitor as it becomes charged. 

Recall the results of Example 1, and refer to Fig. 11.1. Again we use the 
quasistatic approximation, and neglect edge effects. On the cylindrical surface 
of radius a that encloses the volume between the plates, the electric field is 
(Q/na 2 e o)k and the magnetic field is (/iq!/ 2tza)4>. The energy flux density on 
the surface is 


S = -ExB= g/ r. (11.63) 

M o 2^ z a J eo 

Thus field energy flows radially into the cylinder as Q increases, i.e., when I > 0. 
The surface area is 2nad, where d is the distance between the plates, so the total 
rate at which field energy flows into the cylinder is 


Qld 
rza 2 e o 


(11.64) 


The total electric field energy Ue between the plates is the density €qE 2 /2 times 
the volume 7ia 2 d; 


U E 


Q 2 d 

2na 2 €o 


(11.65) 


Therefore dUE/dt = P, because dQ/dt = I by conservation of charge. Field 
energy flowing in through the surface builds up in the field between the plates. 
This justifies the earlier statement that the energy stored in a capacitor resides in 
the electric field. 

But what about magnetic field energy? While the capacitor is being charged, 
i.e., for I ^ 0, there is a magnetic field. The magnetic field energy between the 
plates is 


U M 



Z?2 

——2 Ttrdr d 
2yU0 


Mo I 2 d 
16 n 


( 11 . 66 ) 


where we have used the interior field B^,(r) derived in Example 1. However, this 
magnetic field energy is negligible compared to the electric field energy. The ra- 
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tio is 


U M 


a 2 1 2 

u E _Mof °8 e 2 ' 


(11.67) 


Because fioeo is 1/c 2 , where c is the speed of light, the ratio is of order a 2 /(cr) 2 , 
where r = Q/1 is a characteristic time of the charging process. A realistic sit¬ 
uation would have r » a/c, i.e., r much larger than the time for light to cross 
the radius of the capacitor, so Um is indeed negligible. This example is a good 
illustration of the quasistatic approximation. Because we used the quasistatic ap¬ 
proximation for the fields when we calculated P, we must neglect Um to be con¬ 
sistent, and this would be a good approximation in most real cases. On the other 
hand, if the time constant r is not large compared to a/c, e.g., for a very high fre¬ 
quency alternating current, then a significant amount of electromagnetic radiation 
occurs, energy propagates away from the system in electromagnetic waves, and 
the quasistatic approximation breaks down. 


EXAMPLE 4 Poynting vector for a current-carrying wire. Consider a segment 
of conducting wire that extends from z = 0 to z = i, as shown in Fig. 11.6. The 
power supplied to the moving charges in the dc current /, by the potential dif¬ 
ference Vo from one end of the segment to the other, is P = IV o. Although 
this power is ultimately supplied by a battery, or generator, which may be re¬ 
mote, it may be considered as transmitted by Poynting’s vector across the surface 
of the wire at r = a, in the following way: At the surface the magnetic field 
is B = (fj-ol/2na)(p. The electric field inside the wire and on the surface is 


z 



FIGURE 11.6 Example 4. There is a potential difference Vo and resulting current I 
across the wire segment of radius a and length i. The power flowing into the wire through 
the cylindrical surface, calculated from the Poynting vector, is equal to the power dissipated 
in resistance P = I Vq. 
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E = J/cr = (Vo/^)k- Thus at r — a the Poynting vector S is — (V^I /In a(.)r. 
Because the surface area across which S flows is 2 nal, the power inward through 
the surface is Vo I. 


In Examples 3 and 4 the Poynting vector is considered formally as transmitting 
power into a system. However, this viewpoint seems nonintuitive and abstract, 
especially in Example 4. Another, even more extreme example, is illustrated in 
Fig. 11.7, which shows a bar magnet and a point charge near the N pole. It is 
not difficult to see that the Poynting vector for this static system carries energy in 
closed circles in the <fi direction. However, this energy cannot be turned into heat, 
nor do work, nor be detected by any means! 

When we use the Poynting vector to analyze energy flow in electromagnetic 
waves, in Sec. 11.5, the interpretation will be clear and natural. 

11.4.2 ■ Field Momentum 

The electromagnetic field carries momentum as well as energy, and the momen¬ 
tum is also locally conserved. There is a continuity equation for momentum. But 
momentum is more complicated than energy. Energy is a scalar, so its fl ux S is 
a vector. But momentum is a vector, so its flux is a tensor of second rank. Here 
we are not going to derive the continuity equation for momentum conservation, 
which is a vector equation involving the divergence of the flux tensor, but we 
will examine the interesting result. The momentum density (momentum per unit 
volume, dP em /dV, a vector) of the electromagnetic field is 14 



FIGURE 11.7 Magnet and charge. A bar magnet and a point charge Q have static fields 
B and E, respectively. With these fields there is a time-independent Poynting vector S in 
the directions shown, i.e., tangent to circles around the symmetry axis. The energy flow 
associated with S in this case is merely formal; it has no physical significance because it 
cannot be detected. 

14 For a derivation see Griffiths, Chapter 8, or Jackson, Chapter 6. We’ll derive this result using rela¬ 
tivity in Chapter 12. 
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dP. 


= M o^oS. 


( 11 . 68 ) 


dV 


This result is consistent with the photon theory of electromagnetic waves. If the 
density of photons in a beam of light is n y , and each photon has energy E y 
and momentum p y , then the energy density is u = E y n y , the energy flux is 
S = E y n y c, and the momentum density is p y n y . Here c is the speed of light, or 
photons. The energy of a photon is E y = p y c, as for any massless particle in spe¬ 
cial relativity. 15 Thus the momentum density is S/c 2 , which agrees with (11.68) 
because mo^o = 1/c 2 . 

We will derive (11.68) in Chapter 12 using the relativistically covariant form of 
the theory. In relativity, energy and momentum are linked. The relativistic equa¬ 
tion combines conservation laws of both energy and momentum. 

There is also angular momentum in the electromagnetic field. The angular mo¬ 
mentum density (angular momentum per unit volume) is 


d Lem 
dV 


= r x 


dV 


= /rotor x S. 


(11.69) 


EXAMPLE 5 The Feynman disk paradox. In Fig. 11.8 an insulator disk is free 
to rotate on its axis. Attached to the disk coaxially there are: (1) a solenoidal coil 
that can be energized by a battery on the disk, and (2) a ring of positive charge 
fixed on the disk. Initially the battery is not connected to the coil, no current flows, 
and the system is at rest. 

Suppose the battery is now connected to the solenoid, by closing a switch, so 
that current flows in the solenoid in the direction shown. This causes a magnetic 


z 



FIGURE 11.8 The Feynman disk paradox. When the switch is closed there is an im¬ 
pulsive torque on the disk. Is angular momentum conserved? 

15 Recall the energy-momentum relation in special relativity, £ = yfp^cP^rn^A . jp (h e particle mass 
m is 0, then E = pc. 
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field in the +z direction through the coil. The changing flux that occurs while 
the current builds up to its final value produces an emf in the — 0 direction, by 
Lenz’s law, which produces an impulsive torque on the ring of charge in the — z 
direction. The disk begins to rotate. But the initial angular momentum was 0, and 
no external torques have been applied, so the final angular momentum should still 
be 0. How can this be? 

The resolution to this apparent paradox is that when current is flowing in the 
solenoid there is angular momentum in the electromagnetic field; its density is 
r x (E x B) eo by (11.69). This field angular momentum is equal but opposite to 
the mechanical angular momentum gained by the disk. Therefore the total angular 
momentum does remain 0. If the battery were now disconnected, by opening the 
switch, the magnetic field would decrease to 0 and there would be an emf in the 
+0 direction. The resulting impulsive torque on the ring of charge, which would 
be in the +z direction, would bring the system back to rest. 

The fact that an electromagnetic field—something without mass—can have 
momentum and even angular momentum seems surprising. But from the view¬ 
point of quantum electrodynamics it is quite natural. A photon carries energy and 
momentum like a massless particle. 


11.5 ■ ELECTROMAGNETIC WAVES IN VACUUM 

In this section we consider electric and magnetic fields without charges or cur¬ 
rents. 

The Maxwell equations in vacuum, without any matter at all, i.e., with p = 0 
and J = 0, are 


V • E = 0 (11.70) 

V x E = —dB/dt (11.71) 

V ■ B = 0 (11.72) 

V x B = /x 0 e 0 9E/9r. (11.73) 


Although no charge sources are present in the space being considered, fields may 
exist there in the form of electromagnetic waves. Charge somewhere else in the 
universe may have created the waves in the first place, but the waves continue to 
exist, and propagate through the vacuum with constant energy into places where 
no charge is present. 

There may also be static fields in a region where p = 0 and J = 0, produced 
by static charges and currents outside the region. But we are now interested in 
time-dependent fields, propagating through a vacuum, independent of any charge, 
so we ignore possible static field components. 

The purpose of this section is to find the mathematical properties of these 
wave solutions, and relate them to the physical properties of light and other 
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forms of electromagnetic radiation. In Chapters 13-15 we will consider the inter¬ 
action of electromagnetic waves with matter. 

11.5.1 ■ Derivation of the Wave Equation 

The wave equation for a quantity < p(x,t) that has linear wave motion is 

? 1 3 2 co 

= ° (11.74) 

where c is the wave speed. To see why c is the wave speed, consider propagation 
of plane waves in one dimension x, with (p independent of y and z. Then the 
equation is satisfied by any function of the form (p(x, t) = Fix — ct). If the 
function F(£) has a feature, e.g., an extremum or a node, at a certain value £o of 
its argument, then that feature in the solution Fix — ct) is located at x = £o + cf, 
the feature moves in the x direction with speed c. 

Equation (11.74) arises in many places in theoretical physics; for instance, <tp 
could be the pressure or density in sound, the displacement from equilibrium of a 
point in an elastic medium, or the electric or magnetic field in light. All of these 
are quantities that may be described by the wave equation. 16 

To derive (11.74) from Maxwell’s equations is rather straightforward. Use the 
“curl curl trick.” Consider V x (V x E). By the double cross product identity, it is 

V x (V x E) = V(V ■ E) - V 2 E - -V 2 E, (11.75) 

where the second equality follows from (11.70). But V x E is —3B/3? by Fara¬ 
day’s law, and making this replacement on the left-hand side of (11.75) we can 
also write 

V x (V x E) = V x (—3B/3r) = (V x B) = -ix 0 e 0 ~ (11.76) 

at at z 

where the final equality follows from (11.73), the displacement current. Compar¬ 
ing (11.75) and (11.76) we see that E(x, t) satisfies the wave equation 

o2 

V 2 E-M€ 0 -^2 -o. (11.77) 

The result is actually a vector equation. Each Cartesian component of E satisfies 
the wave equation of the form (11.74). By a similar calculation 17 B(x, t) satisfies 
the same equation 

o 3 2 B 

V“B -/xoeo-^y = 0. (11.78) 

16 We assume that the reader has encountered the wave equation and its solutions, at least in one 

dimension, in an introductory physics course. It might be advantageous to review the wave equation 

in an elementary textbook, such as those listed at the end of Chapter 1, or in one of the specialized 
books listed at the end of this chapter. 

17 See Exercise 8. 
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Equations (11.77) and (11.78) show that the fields obey the wave equation, 
but there is more to the story than that. What these equations do not show is 
the relationship between E and B in the wave. The waves in E and B are not 
independent, but must satisfy very strict conditions, which we will explore. For 
this reason, the waves are properly called electromagnetic. 

The speed of electromagnetic waves is, according to (11.77) and (11.78), 

c=—= ■ (11-79) 

vw0 

When Maxwell derived this result he knew the values of the electric and magnetic 
parameters eo and /M) from experimental measurements of electric and magnetic 
forces. He found that the speed c calculated from (11.79), is, within the exper¬ 
imental errors, the same as the measured speed of light. By this mathematical 
theory Maxwell discovered that light is an electromagnetic wave phenomenon. 

Today we use (11.79) to determine the value of eo. In SI units IM) is assigned the 
value 4 n x 10~ 7 N/A 2 , which defines the ampere (A) as we learned in Chapter 8. 
Then cq is 


eo 


1 

tlQC 2 


C 2 

= 8.85 X 1(T 12 -r. 

Nm 2 


The experimental verification of the full theory comes from a comparison of 
this €q calculated from the wave speed, and the value of eo measured from the 
Coulomb force between static charges. The two methods of evaluating eo give the 
same result. 

Maxwell’s theory of electromagnetic waves—solutions of the field equations— 
predicted the existence of forms of radiation that were unknown in his time, 
having wavelengths outside the range of visible light. These predictions were 
verified years later. For example, in 1887 Hertz discovered what we now call 
radio waves, which are long-wavelength electromagnetic waves. Also, in 1895 
Roentgen discovered X rays, which we now know are short-wavelength electro¬ 
magnetic waves. These, and all forms of electromagnetic radiation, travel with 
the same speed c in vacuum. 

The universality of the speed of light led to Einstein’s theory of relativity in 
1905. 

There exist electromagnetic waves with any wavelength, or frequency. The 
wavelength A and frequency v are related by Xv = c, a relation that holds for any 
kind of harmonic waves. Table 11.3 lists the different parts of the electromagnetic 
spectrum, and how each part of the spectrum is used in practical applications of 
electromagnetic waves. 


11.5.2 ■ An Example of a Plane Wave Solution 

Because the wave equation is linear, electromagnetic waves satisfy the Superposi¬ 
tion Principle: Any linear combination of solutions is also a solution. A complete 
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TABLE 11.3 The electromagnetic spectrum 


Frequency (Hz) 

Description 

Wavelength 

10 2 

super low frequency (SLF) radio waves 
submarine communication 

3000 km 

10 3 

ultra low frequency (ULF) radio waves 

300 km 

10 4 

very low frequency (VLF) radio waves 

30 km 

10 5 

low frequency (LF) radio waves 
marine radio 

3 km 

10 6 

medium frequency (MF) radio waves 

AM radio is 0.53 x 10 6 to 1.60 x 10 6 Hz. 

300 m 

10 7 

high frequency (HF) 
short-wave radio 

30 m 

10 8 

(VHF) aircraft radio and navigation 

FM radio is 0.87 x 10 8 to 1.08 x 10 8 Hz. 

TV channels 2-13 

3m 

10 9 

(UHF) cellular telephones, radar, 

microwave ovens, TV channels 14-83 

30 cm 

10 10 

(SHF) microwaves, radar, mobile radio 

3 cm 

10 11 

extremely high frequency (EHF) 

Cosmic microwave background 
maximum is at 3 x 10 11 Hz. 

3 mm 

10 12 

far infrared 

0.3 mm 

10 13 

far infrared 

30 txm 

10 14 

near infrared 

Visible light is 3.9 x 10 14 to 7.6 x 10 14 Hz. 

3/rm 

10 15 

near ultraviolet 

0.3 n m 

10 16 

vacuum ultraviolet 

30 nm 

10 17 

soft X rays 

3nm 

10 18 

soft X rays 

0.3 nm 

10 19 

hard X rays 

30pm 

10 20 

gamma rays 

3 pm 

10 21 

gamma rays 

0.3 pm 

10 22 

cosmic gamma rays 

30 fm 


set of solutions—complete in the sense that any solution can be written as a super¬ 
position of these solutions—is the set of plane waves. A plane wave is a solution 
of the wave equation with definite values of wavelength A and frequency v, related 
by c = Av. The wave fronts of a plane wave are infinite planes, perpendicular to 
the direction of propagation. 

In Sec. 11.5.3 following we’ll derive the general plane wave solution from 
the field equations. But before we go through the derivation it will be useful to 
examine a specific example of a plane wave, to understand its basic form. The 
reader will have to be patient to see where this form comes from; that will be 
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derived in the next section. For now let’s focus on understanding the geometrical 
characteristics of one specific example of an electromagnetic plane wave. 

The electric field for a plane wave traveling in the z direction, polarized in the 
x direction, is 


E(x,r) = E 0 e i(kz ~ at) \. (11.80) 

Eo is the amplitude, i is the polarization direction, the wavelength is A = 2 n/k, 
and the frequency is v = co/ln. The parameter k is called the wave number. The 
relation between co and k, called the dispersion relation, is 

co = ck, (11.81) 

which is equivalent to Av = c. Equation (11.80) shows that the wave propagates 
with velocity co/ k, which is called the phase velocity. Then by (11.81) the phase 
velocity of the plane wave is c. 

We’ve written the field at x and t as a complex number, but of course phys¬ 
ically E must be real. When we write (11.80), using the complex exponential 
function, we mean that the physical field is the real part of the right-hand side of 
the equation. The use of complex numbers, though it will take some getting used 
to, is tremendously convenient. 18 This will be explained more in the subsection 
following. 

There are several ways to see that (11.80) satisfies the wave equation. The 
simplest is just by direct substitution. V 2 acting on the exponential e ,kz becomes 
— k 2 , and 3 2 /3 1 2 acting on e~' wt becomes — oj 2 ; so the wave equation is satisfied if 
k — (d/c. Or, by a more roundabout argument, note that E(x, t) can be written as 

E(x, 0 = F(z - ct) i, 

where F(f ) = exp(ik^); the right-hand side has the familiar form of a one¬ 
dimensional wave, propagating in the z direction with speed c. The physical field 
E is Eo cos (kz — cot) i, the real part of (11.80). It oscillates in the ± i direction 
(the polarization direction) as the wave propagates in the k direction. 

Now, it is not enough that E(x, t) satisfies the wave equation. The wave equa¬ 
tion is necessary but not sufficient. E(x, t) and B(x, t) must together satisfy the 
four Maxwell equations. Therefore there must be a magnetic field as well, coupled 
with the electric field in ways specified by Maxwell’s equations. The magnetic 
field, which must be present with the electric field (11.80), is 

B(x, t) = Boe i{kz - at) ], (11.82) 


where 


B 0 = Eo/c. (11.83) 

18 “Something of the unreal is necessary to fecundate the real..Wallace Stevens in his preface to 
William Carlos Williams, Collected Poems (Objectivist Press, New York, 1934). 
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FIGURE 11.9 Electric and magnetic fields in an electromagnetic wave. The direction 
of propagation is k. The wave fronts are planes parallel to the xy plane. The figure is a 
snapshot of the fields along one line perpendicular to the xy plane. E oscillates in the ± i 
direction, and B in the ± j direction. Nodes of E and B coincide, as do the field maxima. 
E x B points in the direction of propagation. 


Again, we will derive this result in complete generality in the next section, but 
first let’s see what it means. Note that E and B oscillate in orthogonal directions 
(i and j, respectively) and the wave in both cases travels in the third orthogonal 
direction (k). That is, E, B, and k form an orthogonal triad of vectors. 

Figure 11.9 illustrates the geometrical form of the plane-wave solution (11.80) 
and (11.82). 

How to Treat Complex Waves 

The solution (11.80) and (11.82) is a special case of the general solution to be 
derived in the next section, but as an exercise the reader should verify specifically 
that these functions satisfy all four of the Maxwell equations. 19 

The solution (11.80) and (11.82) is called a complex wave, because E x and B y 
are complex numbers at any point of space and time. Recalling Euler’s identity, 
the complex exponential is 

e ,d = cos6 + / sin0, (11.84) 

where of course i = yf—\. The complex number e ,e has magnitude 1. Because 
e ,e is periodic in d with period 2n, the wave form e‘ (kz ^ ajt) is periodic in z with 

19 


See Exercise 9. 
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period A. = 2:x/k (the wavelength), and periodic in t with period r = 2n/(o (the 
reciprocal of the frequency). 

But what is meant by a complex field, physically? Physical fields are defined by 
forces and surely these must be measured in real numbers ! The use of the complex 
wave is just a mathematical convenience—a kind of trick that we employ because 
calculations with complex exponentials are easier than with sinusoidal functions. 
When we write equations (11.80) and (11.82) we mean that the real part of the 
complex wave is the physical field. (The real part of e ,e is cosh.) The real part 
is just understood. Because Maxwell’s equations are linear in the fields, both the 
real and imaginary parts of the complex wave satisfy the equations separately. 

The constant coefficients Eo and Bo may also be complex. If Eo = \Eo\e la 
then the physical field is 

Re = \Eo\cos{kz-cot +a)i. 

This shows that the phase angle a of the complex amplitude Eo is the phase shift 
in the physical wave. That is, a determines the location of crests, troughs, and 
nodes for given t. 

As long as we consider only expressions linear in the fields we may delay 
taking the real part. But before doing any calculation that is nonlinear in E and B, 
we must first take the real part, reducing the fields to real numbers. For example, 
the energy flux is E x B/mo- To calculate it we must first take the real parts of E 
and B. It is understood in the formula S = E x B//ro that E and B are the physical 
fields. 


EXAMPLE 6 What are the energy flux, energy density, and momentum density in 
the polarized plane wave (11.80) and (11.82)? 

The energy flux density S(x, t), i.e., the Poynting vector (11.57), is quadratic 
in the fields, so to calculate S for the plane wave we must first take the real part in 
(11.80) and (11.82). Then the result is 

E 2 

S = —— cos 2 (kz — <wt)k. (11.85) 

Mo c 

The direction of S is k, the direction of wave propagation. The wave intensity 1 is 
defined as the energy flux, in the direction of propagation, averaged in time over 
one period of oscillation. The units of J, like S, are W/m 2 . The average of cos 2 9 
over one period is because cos 2 9 oscillates between 0 and 1 symmetrically 
about j. Therefore the wave intensity is 


E 2 
_ £ o 

2/xo c 



I = k • S, 


2 


( 11 . 86 ) 
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The energy density u in the wave can be calculated from (11.58), which gives 

,172 b 2 

w(x, t) — -- cos 2 (kz — cot) + —^— cos 2 (kz — cot). (11.87) 

2 2 p 0 

The electric and magnetic contributions to u are equal in an electromagnetic wave, 
because Bq = Eq/c. The equality of Uf and um means E and B have equal 
importance in a free wave; the wave is truly “electromagnetic”! We may also 
write 


w(x, t) = € 0 E% cos 2 (kz - cot). 


( 11 . 88 ) 


Note by comparing (11.85) and (11.88) that S = uc. This little equation has a 
big meaning— energy conservation. Picture a cylinder with cross section A and 
length cdt. The wave energy in the volume A(cdt) will pass through the area 
A in time dt. That energy may be expressed either as uAcdt or as SAdt\ hence 
S — uc. It is also easy to verify Poynting’s theorem for this free electromagnetic 
wave; i.e., that V • S = —du/dt. 

The momentum density in the wave, calculated from (11.68), is 


dPem S W ~ 
~dV~ = c2 = c ' 


(11.89) 


where again u is the energy density. Equation (11.89) agrees with the photon 
theory of light; The energy-momentum relation for a photon is E = pc, as for 
any massless particle in special relativity. Therefore the momentum density in a 
beam of photons is equal to the energy density divided by c, in agreement with 
(11.89). 


Radiation Pressure 

Because an electromagnetic wave carries momentum, it exerts a pressure, called 
radiation pressure, when it hits a material surface. Suppose the material absorbs 
all the light incident on it. Then because momentum is conserved the material 
gains the momentum of the light. Recall from mechanics that dp = F dt is the 
change of momentum during dt due to a force F. Thus the pressure on an absorb¬ 
ing area A for normal incidence is Trad = E/A = ( dP)/(Adt), where dP is the 
wave momentum contained in the volume A(cdt). 20 By (11.89) dP is uAdt, so 
the radiation pressure is Trad — u. In terms of intensity 2, the average pressure on 
an absorbing surface is (Trad) = ( u ) = 2/c? x 

Radiation pressure is usually small (cf. the exercises) but in extreme cases 
it can be very large indeed. A dramatic example occurs in inertial confinement 

20 Please do not confuse V (pressure) with P or P (momentum). 

2 'The brackets (...) indicate the average of the enclosed quantity over a period of oscillation. For 
example, the intensity is I = ( S). 
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fusion experiments, such as are carried out at Lawrence Livermore National Lab¬ 
oratory, in which high-power lasers compress and heat small particles of matter. 


EXAMPLE 7 Consider a high-power laser, operating at a wavelength of 1.6 /um, 
which puts out 10 kJ in 0.2 ns. The beam is focused on an area of diameter 0.5 mm. 

First, what is the RMS electric field strength? The intensity (mean power per 
unit area) is I = 2.5 x 10 20 W/m 2 , which by (11.86) implies £nns = Eq/^/2 = 
3 x 10 11 V/m. Note that E nm is far above the breakdown field strength of air 
(3 MV/m at room temperature and 1 atm) and even above that of fused silica 
(10 3 MV/m). 

Second, what is the radiation pressure of the laser on an absorbing surface? 
We found above that (P r ad) = X/c, so CP ra d) = 8.5 x 10 11 Pa, which is about 
8.5 x 10 6 atmospheres. 


11.5.3 ■ Derivation of the General Plane Wave Solution 

The particular solution (11.80) and (11.82) examined in the previous section was 
just pulled out of a hat. Let us now derive mathematically the general harmonic 
plane wave solution. By a harmonic wave we mean a solution with a definite 
frequency v = a)/2rc. By a plane wave we mean that the wave fronts are planes. 
Therefore the fields, as complex waves, have the form 

E(x, r) = E 0 e ,(k * x_ "° (11.90) 

B(x, t) = B 0 e' (k ' x_ “ ,) , (11.91) 

where Eo and Bo are constant vectors. The real parts are understood to be the 
physical fields. These functional forms describe an electromagnetic wave prop¬ 
agating in the direction of k with phase velocity co/ k. The vector k, which de¬ 
termines the wavelength (A = 27 t/ k) and the direction of propagation, is called 
the wave vector. In general the wave vector may be in any direction, of the form 
k x \ ky j -I - kz k. 

In a vacuum, the Maxwell equations (11.70) and (11.72) require 

k • E 0 = 0, (11.92) 

k • B 0 = 0; (11.93) 

(The reason is that the x dependence of either E or B is in e' k ' x , and when V 
acts on the exponential it gives the exponential back again, times i k by the chain 
rule.) Thus an electromagnetic wave is a transverse wave: The fields oscillate in 
directions perpendicular to the direction of propagation. The Maxwell equations 
(11.71) and (11.73) require 


k x Eo = wBo, 
k x Bo = -MoeowEo; 


(11.94) 

(11.95) 
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FIGURE 11.10 The triad k, Eo and Bo. Note that k x Eo is in the direction of Bo, and 
k x Bo is in the direction of — Eo. 


because 3/3 1 acting on e~ ,a>t gives the exponential times —ico. Therefore Eo and 
Bo are perpendicular to each other, as well as perpendicular to k. Figure 11.10 
shows the directions of k, Eo and Bo implied by (11.94) and (11.95). The three 
vectors Eo, Bo, and k form an orthogonal triad of vectors, with Eo x Bo parallel 
to k. The Poynting vector is 

1 ^ 

S =—Eo x Bo cos 2 (k • x — cot) (11.96) 

MO 

so the energy flux is in the direction of the wave vector k. 

The wave is transverse. Each field oscillates, in either space or time, in a 
direction perpendicular to the direction of propagation. An infinite number of 
transverse directions exist, but any transverse vector can be written as a linear 
combination of just two basis vectors. We say that plane waves with wave vec¬ 
tor k have two linearly independent polarizations. 22 For example, if k is in the z 
direction, then i and j can be taken as the basis vectors for the transverse oscil¬ 
lations. Any plane wave is a superposition of the two basis polarizations. We will 
study polarization effects in optics in Chapter 13. 

So far we have identified the directions of the vector fields. Now we turn to 
their magnitudes, and the wave speed. Since Eo, Bo, and k form an orthogonal 
triad, the cross products in (11.94) and (11.95) are simple. In (11.94) the magni¬ 
tude of k x Eo is just kE o, and the direction is parallel to Bo, so (11.94) implies 


Eo co 
~Bq ~ T 


(11.97) 


In (11.95) similarly |k x Bol = kBo and k x Bo is parallel to —Eo, so (11.95) 
implies 


Eo = k 
B 0 


(11.98) 


22 


See Exercise 19. 
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Equating the two ratios shows that 


ru 2 1 
k 2 e 0 M0 ’ 


(11.99) 


This relation is required by Maxwell’s equations. Now, co/kisc by the dispersion 
relation co = ck. So the wave speed is 1 /JJL o?o, as found before in (11.79). Then 
by (11.97) the ratio of amplitudes must be Eo/ Bq = c, as stated earlier in (11.83). 
Summarizing, for any harmonic plane wave, 


a) 1 E o 

- =c= -- and B 0 =—. (11.100) 

k c 

Equations (11.90) and (11.91) are the most general linearly polarized harmonic 
plane wave. Any linear combination of plane waves is also a solution of Maxwell’s 
equations, by the superposition principle. The converse is also true: A remarkable 
theorem of Fourier analysis states that any solution of the Maxwell equations 
in vacuum can be written as a superposition of plane waves. Common electro¬ 
magnetic waves, such as light from the sun or from an electric bulb, are unpo¬ 
larized and not monochromatic. Such waves are complicated superpositions of 
plane waves. For example, sunlight is white light—a superposition of all visi¬ 
ble wavelengths (and also infrared and ultraviolet wavelengths) with no preferred 
direction of polarization. The electric field direction at any particular point in sun¬ 
light is constantly fluctuating this way and that, on a very short time scale, because 
there is no preferred polarization direction. An approximation of a linearly polar¬ 
ized harmonic wave could be produced by passing sunlight through a polarizer 
and a color filter, to “pick out” the component with the specified polarization and 
frequency. 

Electromagnetic waves propagate through empty space. Neither matter nor 
charge is needed to maintain an electromagnetic wave. The wave carries its own 
energy and momentum, which are, of course, conserved. Electromagnetic waves 
may travel immense distances, e.g., the light arriving at Earth from distant galax¬ 
ies. These waves of electromagnetic field are a part of the physical universe. 

But field theory is a local theory, specified by differential equations. How are 
the electromagnetic fields at a particular point produced? What is the source of 
the fields at that point, if no charge is present? 

How do electromagnetic waves work? Each field is produced by the variation in 
time of the other field, in a self-generating process: E is produced by the variation 
of B, by Faraday’s law; B is produced by the variation of E, by the displacement 
current. By propagating through space as a transverse wave, the fields satisfy all 
four Maxwell equations at every point of space and time, self-consistently. The 
electromagnetic wave is a beautiful example of an extended geometric structure 
created by local mathematical rules. 

The applications of electromagnetic waves are many, including all of optics. 
Applications usually involve interactions between waves and matter, or the gen¬ 
eration of waves. These topics are covered in Chapters 13 to 15. 
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11.5.4 ■ A Spherical Harmonic Wave 

A plane wave is very much a mathematical idealization. The wave fronts of the 
wave (11.90) and (11.91) are infinite planes perpendicular to k. That is, the fields 
E(x, t) and B(x, t) are uniform over any infinite plane perpendicular to k. The 
source of an ideal plane wave would have to be coherent over an infinite area. In 
contrast, a real physical wave must have finite extent. 

In this section we’ll construct another solution of the Maxwell equations in 
vacuum, that has the form of a spherical wave —a harmonic wave with spherical 
wave fronts propagating outward from the origin. Here we study only the fields in 
vacuum, so we are not yet in a position to characterize the source. The connection 
between waves and their sources is the subject of Chapter 15. For now it’s enough 
to picture the spherical wave as emerging from a point source and propagating 
with equal speed in all directions. 

An analogous example in two dimensions, familiar to everyone, is the circular 
wave on a water surface made by dropping a pebble into a pond. The circular 
wave fronts propagate isotropically away from the source point. However, in our 
example the electromagnetic wave is produced continuously, whereas the pebble 
makes a limited wave train. A better analogy would be the circular waves created 
in a ripple tank by a pointlike object oscillating up and down on the water surface. 

To guarantee that all four Maxwell equations are satisfied, we shall construct 
the solution from a vector potential. The Cartesian components of A(x, t) satisfy 
the wave equation (11.74), and it is sufficient to let A be everywhere in the z 
direction, A = A z k. Then a solution in spherical coordinates is 23 

A(x, t) = 'J'(r, f)k (11.101) 

where k is the unit vector in the z direction, and 

4/(r, t) = -e i(kr - a>,) . (11.102) 

r 

The function T (r, t ) is a spherically symmetric scalar solution of the wave equa¬ 
tion (11.74). Using spherical coordinates to calculate the Laplacian V 2 , it is 
straightforward to show that V 2 4 / = —k 2 ^; also, d 2 ^/dt 2 = —w 2 ^. The wave 
equation is satisfied for co = ck. The potential A(x, t) is singular at the origin, 
i.e., at the source of the wave, because T —► oc as r —► 0. We will not be 
able to determine the nature of the source until Chapter 15, but we can solve for 
the electromagnetic wave propagating away from the origin. It turns out that the 
source is a point-like electric dipole, oscillating in the z direction. For example, it 
could be a small charge undergoing simple harmonic motion on the z axis. 

Now, the magnetic field is B = V x A. It is most convenient to use spherical 
coordinates to calculate the curl, and to express the fields in spherical components. 
Writing (11.101) in spherical coordinates, the vector potential is 


23 See Exercise 26. 



11.5 


Electromagnetic Waves in Vacuum 


435 



FIGURE 11.11 The asymptotic spherical wave. At any time and position the magnetic 
field is azimuthal—parallel to a line of latitude; and the electric field is orthogonal to B— 
parallel to a line of longitude. At the time and radius shown, B and E are in the +<p and 
+6 directions, respectively. One half period later at this radius B and E will be in the — <f> 
and —6 directions. The Poynting vector points radially outward. 


A(x, t) = ^(r, f)(?cos# — 0sin0) = rA r + OAq. (11.103) 

Its curl is 24 


d> 

B(x, t) = - 
r 


( rAg )-^ = 4> (-ik + sin0 'I'fr, t). (11.104) 


The wave propagates in the ? direction, because of the factor e‘ <kr "0 j n ;). 
The magnetic field at any point is in the azimuthal direction (f) as illustrated in 
Fig. 11.11. This direction is consistent with the fact that the wave comes from an 
electric dipole, oscillating along the z axis, because B curls around the current. 

One approach to determining the electric field might be to construct the scalar 
potential V(r, t), consistent with the Lorentz gauge choice. However, that is not 
necessary. It is easier to find E(x, t) directly from B(x, t ), using the Ampere- 
Maxwell equation (11.73). E(x, t) must have the same frequency as B(x, t), i.e., 
E(x, t) has the form E(x)e~"' ,; . Then the time derivative of E is equal to — icoE, 
and (11.73) implies 

c 2 V x B 

E(x, t) = - : -. (11.105) 

—ICO 


To calculate the curl of B is somewhat tedious, but the final result is not too 
complicated: 


24 See Exercise 27. 
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E(x, t) = rE r + OE 0 , (11.106) 

where 

E r = ^7—(— —cos6> ^(r, t) (11.107) 

ico \ r r L ) 

E e = — (k 2 + — - -41 sine 4/( r , t). (11.108) 

ico \ r r L / 

The electric field is everywhere orthogonal to the magnetic field. 

It can be shown by direct substitution that B(x, t) and E(x, t) satisfy all four 
Maxwell equations in free space (for r > 0). But that rather long calculation is not 
really necessary because we can verify the equations by general considerations. 
B is a curl, so V • B = 0. E is also a curl by (11.105) so V • E = 0. Equation 
(11.105) was based on (11.73) so the latter equation is satisfied. All that remains 
is to check (11.71), and that is guaranteed by the fact that B satisfies the wave 
equation, because (r, t) does. 

This spherical wave has a complicated dependence on r. For small r, i.e., r <$C 
A, the dominant field is the electric field, which is of order r -3 ; that region is 
called the near zone, and the form of E is determined by proximity to the point 
source. For intermediate r, i.e., r ~ A, the fields do not have a simple power law 

dependence on r. For large r, i.e., r » A, both fields are of order r -1 ; that region 

is called the radiation zone. The asymptotic fields for large r are 

B asy (x, t) = —ik sin# ^(r, t)4>, (11.109) 

E asy (x, t ) = — icusin# 4>(r, t)0. (11.110) 

(The real part of the right-hand side is understood.) These radiation fields, together 
with the direction of propagation r, form an orthogonal triad at every point in 
the radiation zone, with E a sy x B asy in the direction of ?. The magnetic field is 
azimuthal, i.e., parallel to lines of latitude, as shown in Fig. 11.11. The electric 
field is parallel to the lines of longitude, also shown in the figure. The Poynting 
vector is radially outward, so the wave propagating radially is carrying energy 
outward. 

Note that the asymptotic fields are proportional to r~\ and so the energy flux 
per unit area is proportional to r~ 2 . Therefore energy is conserved in the wave: 
The integrated power through a sphere of radius r is independent of r because 
the area grows as 4 nr 2 . The average power flowing out through a small spherical 
surface enclosing the source is the same as the average power through any larger 
sphere. Of course energy is being fed into the wave by the source at r = 0, which 
is hidden from us because we have only used the field equations in the vacuum, 
outside the source. In Chapter 15 we will see that (11.104) and (11.106) are the 
fields of an oscillating electric dipole at the origin. Some external agent must be 
supplying energy to make the dipole oscillate. 
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The asymptotic fields (11.109) and (11.110) resemble a plane wave in these 
respects: (i) The spherical wave fronts are approximately planar, (ii) The fields 
and propagation direction form an orthogonal triad of vectors, (iii) The ratio 
|B asy |/|E asy | is 1/c. A plane wave is a mathematical idealization, which approxi¬ 
mates a physical wave far from its source. 

The reason plane wave solutions are so important is because they are mathe¬ 
matically complete ; any solution of the wave equation can be written as a superpo¬ 
sition of plane waves. Even the spherical wave is equal to a certain superposition 
of plane waves in all directions. 

11.5.5 ■ The Theory of Light 

In the late 17th century a dispute arose over the nature of light, between Newton 
who argued for a particle theory of light, and Huygens who favored a wave theory 
of light. 

At the beginning of the the 19th century, Young observed interference of light, 
and so showed that light behaves as a wave. Fresnel explained that the optical phe¬ 
nomena of light polarization imply that light is a transverse wave. Later Maxwell 
developed the mathematical theory of the electromagnetic field, and found that 
solutions of the field equations describe the properties of light as an electromag¬ 
netic wave. These concepts were further developed by a group of British and Irish 
physicists, including Heaviside, FitzGerald, and Fodge, who called themselves 
the Maxwellians. 

However, Maxwell’s theory is not the end of the history of the science of light. 
Early in the 20th century, Planck and Einstein showed that light energy is quan¬ 
tized. Therefore light also behaves as if composed of particles, which we call 
photons. Compton did definitive experiments on X-ray scattering that verified the 
predictions of the photon theory. Dirac formulated a quantum field theory of elec¬ 
tromagnetism. This quantum theory of light does not negate Maxwell’s theory 
of electromagnetism. Indeed, Maxwell’s equations are also true in quantum elec¬ 
trodynamics (QED), although their interpretation is different from the classical 
theory because in QED the fields have quantum uncertainty. 

Classical electrodynamics—Maxwell’s theory—is a valid description of elec¬ 
tromagnetic phenomena, including light, in the limit of large field intensities. A 
large field intensity corresponds to large numbers of photons, so that the quantiza¬ 
tion unit (one photon) is small on the scale of the field. The classical theory is the 
limit of QED in the same sense that classical mechanics is the limit of quantum 
mechanics for large mechanical systems. The mean value of the quantum field 
obeys the classical field equations, and if the field intensity is large then the ef¬ 
fects of uncertainty are negligible. The classical theory does break down for very 
small systems, such as a single photon or atom. But the classical theory remains 
important despite this limitation, for the many phenomena that occur on scales 
much larger than a single atom. 
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FURTHER READING 

The following books discuss general properties of the wave equation, in a variety 
of applications. 

F. S. Crawford, Jr., Waves (McGraw-Hill, New York, 1968). 

A. P. French, Vibrations and Waves (Norton, New York, 1971). 

W. C. Elmore and M. A. Heald, Physics of Waves (Dover, New York, 1969). 

H. Georgi, The Physics of Waves (Prentice-Hall, Englewood Cliffs, NJ, 1993). 

The interesting history of the electromagnetic theory of light is recounted in J. 

G. O’Hara and W. Pricha, Hertz and the Maxwellians (Peter Peregrinus, London, 
1987). 

The Feynman Lectures on Physics, Vol II, contain interesting insights on electro¬ 
magnetic waves, especially in Chapters 18 and 21. 


EXERCISES 

Sec. 11.1. The Maxwell Equations in Vacuum and Displacement Current 

11.1. In 1929 at the Free University of Brussels, M. R. Van Cauwenberghe measured, as 
directly as was then possible, the magnetic field caused by the displacement cur¬ 
rent between the plates of a parallel-plate capacitor. In his system the plates were 
specially shaped, but they were approximately parallel disks of diameter 1.5 m sep¬ 
arated by 0.4 m. He applied across the plates an oscillating voltage, with amplitude 
174kV and frequency 50 Hz. For this system, with air between the plates, calculate 
the value of B at radial distance 0.4 m from the center. 

To measure the small field, Van Cauwenberghe devised a magnetometer that 
consisted of a toroidal solenoid with an iron core, suspended parallel to the plates 
in the midplane of the capacitor. By using an iron core he increased the sensitiv¬ 
ity by a large factor, probably more than 10 3 . Also, the solenoid had 813 turns, 
which increased the sensitivity by another factor of almost 10 3 . Ultimately he mea¬ 
sured induced emfs in the solenoid of order 0.1 mV. His measurement agreed with 
Maxwell’s theory of the magnetic field induced by the displacement current in the 
capacitor. (Reference: Van Cauwenberghe, Journal de Physique et le Radium, 10, 
303 (1929).) 

11.2. A capacitor with circular parallel plates, with radius a and separation d, has poten¬ 
tial difference V(t). 

(a) Determine the magnetic field on the midplane of the capacitor, at radius r from 
the symmetry axis, for r > a. [Answer: B(r) = (^CQa 2 /2rd) dV/dt] 

(b) Show that B is the same as the field of a straight wire carrying current / = 
dQ/dt, where Q is the charge on the capacitor. 

11.3. Show that the discontinuity of B across a capacitor plate as the capacitor is being 
charged with current / is equal to //qK x n, where K is 
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By the general boundary condition on the tangential field, K is the surface current 
density that flows radially as the plate becomes charged. Show that this functional 
form of the surface current implies that the charge remains uniformly distributed 
over the area of the plate. (Hint: Show that the rate of change of charge in an 
arbitrary annulus is proportional to the area of the annulus. Therefore the charge 
divided by the area is constant.) 


Sec. 11.2. Scalar and Vector Potentials 

11.4. Derive the equation (11.31) for A(x, t) in the Lorentz gauge. 

11.5. (a) Show that the general equations for the potentials V (x, t) and A(x, t), i.e., 

without any gauge condition, are 

V 2 V + (V ■ A) — — — 

3 1 e 0 

, 3 2 A / dV\ 

VA - n 0 e 0 -^~ -VI V-A + /x 0 co— I = -MoJ- 

(b) The equations in a particular gauge can be obtained from the general ones by 
substituting the gauge condition. Derive, by this method, the field equations in 
the Coulomb gauge and the Lorentz gauge. 


Sec. 11.3. Maxwell’s Equations in Matter 

11.6. (a) Show that the Maxwell equations in matter reduce to the Maxwell equations in 
vacuum if the polarization and magnetization are zero. 

(b) Show that the Maxwell equations in vacuum are the same as the Maxwell equa¬ 
tions in a uniform linear material with e = eq and M = /zo- 


Sec. 11.4. Energy and Momentum of the Electromagnetic Field 

11.7. Consider the electromagnetic field in a linear medium with material properties e 
and /z. Calculate V • S and du/dt for the energy flux S and density u given in 
(11.61) and (11.62). Identify the energy transfer rate dux/dt in (11.52) such that 
the energy continuity equation is satisfied, and interpret the result. 


Sec. 11.5. Electromagnetic Waves in Vacuum 

11.8. Use Maxwell’s equations (11.70) through (11.73) to show that the magnetic field 
B(x, t) satisfies the vector wave equation. 

11.9. Verify explicitly that the specific plane wave solution (11.80) and (11.82) satisfies 
the four Maxwell equations. 

11.10. (a) Determine the vector potential A(x, t) and scalar potential V(x. t ) in the 
Lorentz gauge, for the linearly polarized plane wave described by (11.80) 
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and (11.82), with the boundary condition that the potentials must be finite at 
infinity. (Hint: Let V = 0.) 

(b) Consider now the general linearly polarized plane wave, with wave vector k, 
polarization direction n, and frequency co = c|k|. Determine the vector and 
scalar potentials for this case. 

(c) Suppose the boundary condition in (a) is removed. Show that A(x, t) = 
— Boxe l ^ z ~ wt ^V. and V(x, t) = —cBoxe l ^ liz ~ wt ^ satisfy the Lorentz gauge 
condition and give the fields in (11.80) and (11.82). 

11.11. Calculate V • S and du/dt for a linearly polarized plane wave propagating in the z 
direction and polarized in the x direction. Explain the meaning of the results. Show 
that the electric field and magnetic field have equal energy densities in the plane 
wave. 

11.12. For a wave that depends only on z and t, the wave equation (11.74) becomes 

= 0 

3z 2 c 2 3r 2 

Show that a general solution of this equation may be written as <p(z , t ) = f(z — 
ct) + g(z. +cf), where / and g are arbitrary functions of their arguments. Show by 
substitution in the equation that <p(z, t) = C cos kz cos kct is a solution of the wave 
equation. Then determine / and g for this solution. Explain what is meant by the 
statement that a standing wave is the superposition of traveling waves in opposite 
directions. 

11.13. Consider the electromagnetic field 

E(x, y, t) = Eqcos(tzx/ L) cos(ny /L) sin cut k 

B(x, y, t) = Bq T— cos(7 xx/ L) sin(7r y /L) i + sxninx/ L) cos(ny / L) j J cos cot. 

(a) Show that this field satisfies the Maxwell equations in vacuum if co = \/2izc/L 
and Bq = E 0 /(V2c). 

(b) This field represents a standing electromagnetic wave inside a box with metal 
walls and a square cross section of size Lx L parallel to the xy plane, and very 
long in the z direction. This is an example of a cavity oscillator. Sketch the E 
and B fields. Note that the wave number k is jt/ L, so the wavelength is 2 L. 

11.14. A field meter shows that the amplitude of the electric field oscillation in a certain 
radio wave is 5 millivolts per meter. 

(a) What is the amplitude of the magnetic field oscillation, in T? 

(b) What is the intensity in W/m 2 ? 

11.15. Polarized light is incident on a perfect polarizer, and it is observed that 20% of the 
light intensity gets through. What is the angle between the polarizer axis and the 
polarization direction of the light? [Answer: 63 degrees] 

11.16. A polarized plane electromagnetic wave moves in the y direction, with the electric 
field in the ±.v direction. What is the direction of the magnetic field at a point where 
the electric field is in the — x direction? 
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11.17. Suppose the electric field in an electromagnetic wave is 


E(x, t) = —JL sin(fcy 


■ cot). 


(a) Determine the magnetic field. (Hint: V x E = — 3B/3r.) 

(b) Determine the Poynting vector. 

11.18. Circularly polarized waves. Consider a superposition of waves traveling in the z 
direction, with fields 


E(x,f) =Re{i£ 1 f- , ' (iz- " f) + j £ 2 e !( * Z- " f) } 

B(x, 0 = Re ( j - i El e i(kz-<ot) 

[ c c 


where E\ and E 2 may be complex 

E] = C k ! >i , E 2 = C 2 e i ^. 


(C 1 and C 2 are real.) 

(a) Calculate the average energy flux S a vg- 

(b) Suppose E\ = C and E 2 = iC, i.e., C\ = C 2 = C and <p\ = 0, <j> 2 = jt/ 2. 
Determine the direction of E as a function of t, at a point on the xy plane. 
Describe the result in words and pictures. 

(c) For the same field as (b), determine the direction of E as a function of z, for a 
snapshot of the field at t = 0. Describe the result in words and pictures. 

11.19. Consider the fields 

E(x, t) = i F] (x — ct) + j F 2 (x — ct) + kF 3 (.r — ct) 

B(x, t) = - j^i G] (x — ct) + j G 2 (x — ct) + kG 3 (* — cr)J 

where the functions F \, F 2 ,..., G 3 approach 0 in the limits x —► ± 00 . These 
fields obviously satisfy the vector wave equation. They correspond to a pulse of 
radiation moving in the +x direction. But the Maxwell equations place severe re¬ 
strictions on the components Fj, F 2 , ..., G 3 . 

(a) Show that the Maxwell equations require F] = Gj = 0, G 3 = F 2 , and 
G 2 = — F 3 . (Thus there are only two independent polarizations.) 

(b) Suppose F 2 (t;) = G 3 (£) = Fo exp(—£ /a ) and the other components are 0. 
(£ stands for x — ct.) Make a sketch that shows a snapshot of the fields in space 
at time t. 

11.20. (a) For an ideal plane electromagnetic wave in vacuum, show that the energy 

crossing an area A in 1 second, with A normal to the direction of propaga¬ 
tion, is equal to the total field energy within a cylinder with base A and height 
3 x 10 8 m. 

(b) Show that 1 = cu av and explain this result in terms of conservation of energy. 
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11.21. (a) The intensity of sunlight at the Earth is X so j ar = 1300 W/m 2 . Compute the 

RMS electric and magnetic field strengths (in V/m and T, respectively) for a 
plane wave with intensity 2 S0 ] ar . Does the result depend on the wavelength of 
the plane wave? 

(b) Estimate the RMS electric field strength in the light from a 100 W light bulb, 
at 1 m distance from the bulb. 

(c) The total power of light from a small He-Ne laser pointer is 0.1 mW, in a beam 
of diameter 4 mm. Calculate the RMS electric field strength. 

(d) The wavelength of light from a He-Ne laser is 633 nm. How many photons 
does the laser pointer emit in one second? 

11.22. Consider a radio antenna radiating 20 kW of power in radio waves with frequency 
100 MHz. Estimate the order of magnitude of the RMS electric field strength at a 
distance of 10 km, by assuming the radiation is uniformly distributed in a hemi¬ 
sphere around the transmitter. (More precisely, the intensity would depend some¬ 
what on direction.) 

11.23. Calculate the force due to radiation pressure by sunlight falling in the normal di¬ 
rection on a 1 cm 2 light-absorbing surface. The intensity of sunlight is 1.3kW/m 2 . 
What is the force if the surface reflects light? (Hint: F dt = dp.) 

11.24. The dust tail of a comet points away from the sun because of radiation pressure by 
sunlight. Estimate the order of magnitude of the force on a dust grain with linear 
dimension 1 /tm at the radius of the Earth’s orbit, where the intensity of sunlight is 
1300 W/m 2 . Compare the radiation force to the force of solar gravity on the grain, 
assuming the grain has density 5 g/cm 3 . Can the radiation affect significantly the 
orbit of the dust grain? 

11.25. Why does light exert a pressure on a metal surface? By considering the directions 
of E, B, J, and F, show that there is a force on the metal, due to the magnetic force 
on the electrons, from light incident normal to the surface. The result is actually 
somewhat tricky. J is parallel to E, and in phase with the oscillation of E (that's 
important!) because of resistance, which makes J = ctE. 

The Spherical Wave 

11.26. Show that A(x, t) of (11.101) and (11.102) is a solution of the wave equation. 

11.27. Carry out the calculations of B and E for the spherical wave in Section 11.5.4. 
First calculate B = V x A, where A is given by (11.101), using spherical polar 
coordinates. Then calculate E from (11.105). 

The asymptotic vector potential at large r is proportional to r -1 , so one might 
guess that the asymptotic fields, involving derivatives of A, would be of order r -2 . 
But the asymptotic fields are of order r~ , a very important result because it implies 
there is an outgoing flux of energy. Where do the terms of order r -1 come from? 

11.28. The wave fronts of the spherical wave in Sec. 11.5.4 are spheres. However, the 
energy flux is not isotropic. 

(a) In the radiation zone show that the differential power dP/dO., i.e., the average 
power per unit solid angle, is proportional to sin 2 0, where 0 is the polar angle. 
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(Hint: dP/dQ is r 2 S av ■ r; calculate that differential power in the radiation 
zone from the asymptotic fields.) 

(b) Calculate the fraction of the total power within ±10 degrees of the equato¬ 
rial plane. Compare your result to the fractional solid angle of that range of 
directions (i.e., its fraction of the total solid angle 4 jt). 

11.29. Show that for the spherical wave in Sec. 11.5.4, the asymptotic electric field as 
r —> 0, i.e., the near field, is an electric dipole field. (The dipole field is given in 
(3.101).) 

General Exercises 

11.30. Most of the electromagnetic energy in the Universe is in the cosmic microwave 
background radiation, a remnant of the Big Bang. This radiation was discovered 
by A. Penzias and R. Wilson in 1965, by observations with a radio telescope. The 
radiation is electromagnetic waves with wavelengths around 1.1 mm. The energy 
density is 4.0 x 10 -14 J/m 3 . (This is 2.5 x 10 5 eV/m 3 , half the rest energy of an 
electron in each cubic meter of the Universe.) 

(a) What is the RMS electric field strength of the cosmic microwave background 
radiation? [Answer: 0.067 V/m] 

(b) How far from a 1000 W transmitter would you have to go to have the same 
field strength? Assume the power from the transmitter is isotropic. [Answer: 
2.6 km] 

11.31. What are the (real) electric and magnetic fields for a monochromatic plane wave 
whose polarization is parallel to the xy plane, and which is traveling in vacuum in 
the direction from the origin to the point (0, 1, 1)? Express your answers for both 
fields in terms of the amplitude Eo of the electric field oscillations, the frequency 
a), and the speed of light c. Use Cartesian coordinates and basis vectors. 

11.32. Table 11.3 shows a wide range of frequencies and wavelengths for electromagnetic 
radiation. The relation Xv = c holds over all time and length scales. 

(a) What is v if X is the Earth-moon distance? 

(b) What is v if X = 2 fm. the diameter of a He nucleus? What is the energy in eV 
for a photon of that frequency? 

11.33. In 1991 the Fly’s Eye detector in Utah observed a cosmic ray with energy 320 EeV 
(1 EeV=10 18 eV). The identity of the primary particle is unknown, but it might 
have been a gamma ray. What is the photon wavelength for this energy? How does 
it compare to the size of a proton? 

Computer Exercise 

11.34. Consider an electromagnetic wave with vector potential A(x, t) = j f(x — ct). 
(The scalar potential is 0.) The function f(x — ct) approaches 0 as x -*■ ±oo, 
so the electromagnetic field is a wave packet. Suppose the wave hits an electron 
(charge —e, mass m) initially at rest at the origin. 

(a) Derive the equations of motion for the electron velocity components v x , 

Vy, V Z . 
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(b) Show that u z (!) = 0. 

(c) Show that v y (t) = (e/m)f(x — ct), where x is the x position of the electron 
at time t. 

(d) Show that 

(e) Describe in words and pictures the trajectory of the electron, assuming the 
wave packet has a short length. In particular, show that the electron will have a 
positive displacement in the x direction. 

(f) Assume v } t « c and /(£) = K exp(— % 2 /d 2 ). Use a computer to solve the 
differential equation for x(t) numerically and plot the result. (Let d be the 
unit of length, d/c the unit of time, and choose a relatively small value of the 
dimensionless constant eK/(me).) 

(g) Solve for y(t) and plot the trajectory in space. (For example, in Mathematica 
solve simultaneously the coupled equations for x(t) and y(t) using NDSolve 
and make the plot with ParametricPlot.) 



CHAPTER 



Electromagnetism and Relativity 


“Everything should be made as simple as possible, but not simpler.” 

Albert Einstein 


Einstein’s theory of special relativity, published in 1905, 1 is the theory of coordi¬ 
nate transformations between inertial frames. It is based on two postulates: 

1. Postulate of relativity. The laws of physics are the same in all inertial 
frames. 

2. Postulate of the absolute speed of light. The speed of light in vacuum is 
the same in all inertial frames. 

An inertial frame means a coordinate system with respect to which Newton’s first 
law of motion—the law of inertia—is true. An inertial frame moves with constant 
velocity with respect to any other inertial frame. A noninertial frame is one that 
accelerates with respect to inertial frames. A reference frame has both a spatial 
coordinate system that locates positions by the distances x,y,z along Cartesian 
axes from an origin, and a time coordinate t that locates events in time. 

Einstein was led to these postulates, especially the second, by the study of 
Maxwell’s theory of the electromagnetic field. Maxwell’s equations imply that 
light is an electromagnetic wave, with wave speed c = 1 / in vacuum. But 

this result raises an obvious question: With respect to what frame of reference 
is the speed equal to ? In classical physics, if an object P moves with 

speed u in one inertial frame, then its speed in another frame is different. For 
example, if the second frame moves with speed v relative to the first frame, in 
the same direction as the velocity of P, then the speed of P in the second frame is 
u — v. However, there is no possibility within Maxwell’s equations for the speed 
of an electromagnetic wave in vacuum to be anything but l/^/po^o- To what 
frame of reference does that speed correspond? The answer is that the speed of 
light is 1/^/p.oco in every inertial frame, and the classical formula for combining 
velocities is wrong. 

Before Einstein, theorists assumed that light propagates in a physical medium 
called the aether. Maxwell, and the field theorists of the next generation, thought 

1 A. Einstein, “On the Electrodynamics of Moving Bodies”, Ann. Phys., 17, 891 (1905). 
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that the field equations would describe the electromagnetic field in the rest frame 
of the aether. Then the calculated wave speed would be the speed relative to the 
fixed aether. In a reference frame moving with respect to the aether, the speed of 
electromagnetic waves would be different than the speed relative to the aether. 

In a famous series of experiments, culminating in a very precise experiment 
in 1887, Michelson and Morley attempted to observe a variation in the speed of 
light due to motion of the Earth through the aether. Using a very accurate inter¬ 
ferometer, they measured the interference between light waves that had traveled 
in orthogonal directions. Varying the orientation of the apparatus would change 
the interference if the aether theory were true, but there was no observed change 
in the interference. The experiment had a null result. This was surely the most 
important null result in the history of physics! There is no detectable variation 
in the speed of light traveling in different directions with respect to the moving 
Earth. This striking result is explained in special relativity by overthrowing a ba¬ 
sic assumption of classical physics—that lengths and time intervals are absolute 
quantities, i.e., the same for all observers. 

The Lorentz transformation is the coordinate transformation between inertial 
frames constructed such that Postulate 2 is obeyed. We assume that the reader has 
some familiarity with this transformation, 2 although we will review it briefly. Our 
main interest in this chapter is to understand how Postulate 1 is satisfied in the 
theory of the electromagnetic field. 


12.1 ■ COORDINATE TRANSFORMATIONS 

12.1.1 ■ The Galilean Transformation 

The first statement of the principle of relativity was by Galileo. He claimed that 
an observer, enclosed below decks in the hold of a large ship, would not be able to 
determine by any experiment whether the ship is at rest or moving with constant 
velocity. This passage from his writings (Ref. 1) is a statement clearly equivalent 
to Postulate 1: 

“Shut yourself up with some friend in the largest room below 
decks of some large ship and there procure gnats, flies, and such 
other small winged creatures. Also get a great tub full of water and 
put within it certain fishes; let also a certain bottle be hung up, which 
drop by drop lets forth its water into another narrow-necked bottle 
placed underneath. Then, the ship lying still, observe how those small 
winged animals fly with like velocity towards all parts of the room; 
how the fishes swim indifferently towards all sides; and how the dis¬ 
tilling drops all fall into the bottle placed underneath.... Having ob¬ 
served all these particulars, though no man doubts that, so long as the 
vessel stands still, they ought to take place in this manner, make the 

2 See, e.g., Refs. 3-6. 
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ship move with what velocity you please, so long as the velocity is 
uniform and not fluctuating this way and that. You shall not be able 
to discern the least alteration in all the forenamed effects, nor can you 
gather by any of them whether the ship moves or stands still.” 

In modem terms we say that the laws of motion are the same in all inertial frames. 

Consider two inertial frames, F and F' , in relative motion. The two frames of 
reference are illustrated in Fig. 12.1. An event that occurs at coordinates x, y, z 
and time t of F , occurs at coordinates x 1 , y', z! and time t' of F'. To be specific, 
suppose frame F' moves with velocity v i, in the x direction, relative to F. For 
example, the origin O' of F' is at (x 1 , y', z!) = (0, 0, 0). The coordinates of O' 
in F are (x , y, z) = (vt, 0,0) so that O' has velocity v i in F. Conversely, the 
origin O of F moves with velocity —v i with respect to IF'. We assume that the 
origins of F and F' coincide at t = 0, which defines what we mean by “time 
zero.” 

The transformation from F to F' is a set of equations that relate the coordi¬ 
nates and time of an event observed in the two frames of reference. The classical 
transformation from Fto F' is called the Galilean transformation. If Amoves 
with velocity v i with respect to T , then the Galilean transformation is 

x' — x — vt, y' = y, z! — z, t' = t. (12.1) 

Note that O' has T' coordinate x' = 0, so by (12.1) it has T coordinate x = vt, 
as specified. 

For an arbitrary velocity vector v of T' relative to T , the Galilean transforma¬ 
tion is 


X||=X|| -Vt, Xj_ = X_|_, t' = t. 


( 12 . 2 ) 


y 

y' 


F 

F' 


" A 


v = 


-► r 


FIGURE 12.1 Two inertial frames of reference. F' moves with velocity v i relative 
to F. 
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where || and _L denote the vector components for directions parallel and perpen¬ 
dicular to v. We emphasize that the relative velocity v must be constant. 

Newton’s equation of motion for a mass m is the same in T or T', provided 
the force is invariant under the transformation. The equation of motion in frame 
J'is md 2 \/dt 2 = F(x). Assume F'(x') = F(x) is the force in the coordinates 
of T'. What is the equation of motion with respect to T' ? We need to write 
the acceleration vector in T' coordinates. First consider the velocity. The particle 
velocity in the frame T is u = dx/dt. The particle velocity in T' is 


u, = 


dx J 
dt' 
dx j 
dt' 


|| d(x || - vt) 


dt 


= Mil 


1 


dx i 
dt 


= uj; 


(12.3) 

(12.4) 


this result is just the classical rule for combining velocities: u' = u — v. 3 The 
Galilean transformation for the acceleration is 


du' d( u — v) 
dt' dt 


because v is constant. Acceleration is a Galilean invariant. Therefore the equation 
of motion in the frame T' is md 2 x'/dt' 2 = F'(x'), the same as in T as claimed. 

We have just seen that Newtonian mechanics satisfies Postulate 1 if the trans¬ 
formation between inertial frames is Galilean and the force is invariant. However, 
Postulate 2 is not satisfied by the Galilean transformation. The coordinate trans¬ 
formation (12.2) implies the classical velocity addition formula, u' = u — v. But 
if u is the velocity of a pulse of light in T , then the speed u' | in T’ would not be 
equal to the speed |u| in T . Einstein’s Postulate 2 implies that (12.2) is not the 
correct transformation between inertial coordinates. 

Equation (12.2) seems so obvious! How can it be wrong? There is an implicit 
assumption hiding in (12.2). These equations assume that lengths and time inter¬ 
vals are absolute quantities—the same for all observers. 4 However, on the con¬ 
trary, length and time are relative. What is absolute is the speed of light. 


12.1.2 ■ The Lorentz Transformation 

The relativistic transformation between inertial frames is called the Lorentz trans¬ 
formation. Again, suppose that an inertial frame T' is moving with constant ve¬ 
locity v i relative to another frame of reference T , as illustrated in Fig. 12.1. The 
Lorentz transformation from coordinates x,y,z and time t of T, to coordinates 
x', y' , z! and t' of T ', is 

x' - y(x — vt), y' — y, z! = z, t' = y(t — vx/c 2 ) (12.5) 

3 We will usually use u to denote particle velocity, and v the relative velocity of different inertial 
frames. If T' is the rest frame of the particle, then u and v are the same. 

4 An “observer” in relativity theory means someone making measurements with respect to his labora¬ 
tory, i.e., his rest frame. 
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where 


1 

y/l ~ V 2 /C 2 


( 12 . 6 ) 


An event that occurs at x, y, z, t in frame T occurs atx', y', z', t' given by (12.5) 
in T'. 

It is a simple calculation to show that (12.5) implies Einstein’s Postulate 2. 
Suppose that a flash of light is emitted from x = 0 at time t = 0. As observed 
in the frame T , there will be a spherical pulse of light traveling away from x — 
0 with speed c. The radius of the expanding light pulse is ct at time t, so the 
coordinates of any point on the pulse satisfy 

x 2 + y 2 +z 2 = c 2 t 2 . (12.7) 


Now consider the same point on the pulse in T' coordinates, given by (12.5). The 
squared distance from the origin O' is 

x' 2 + y a + z' 2 = y 2 (x - vt) 2 + y 2 + z 2 
= y 2 (x — vt) 2 + c 2 t 2 — x 2 
= (y 2 - \)x 2 — 2y 2 xvt + (y 2 v 2 + c 2 )t 2 
= c 2 y 2 (t - vx/c 2 ) 2 . ( 12 . 8 ) 


In the last step we used (12.6) for y. The final result, using (12.5) for is that 

x' 2 + y' 2 + z' 2 = c 2 t' 2 . (12.9) 

That is, the light pulse observed in the inertial frame IF' is also a sphere traveling 
outward from x' = 0 with speed c. Thus Postulate 2 is satisfied if the space 
and time coordinates of different inertial frames are related by (12.5). We could 
reverse this calculation to derive (12.5). If we demand (12.9), and require that the 
transformation be linear and not change the perpendicular coordinates (y and z), 
then the transformation must be ( 12 .5 ). 5 

For an arbitrary relative velocity v the Lorentz transformation is 

x'\\ = y { x \\ ~ vt) , x'j_ = x_l, t'= y (t - vxw/c 2 ^ ( 12 . 10 ) 

where again || and _L denote vector components parallel and perpendicular to v. 
This form of the transformation gives x r and t' in terms of x and t. The inverse 
transformation, which gives x and t in terms of x' and t', is 

x ll = Y (*11 + vt ') > x_L=x'j_, t = y (V + uxf|/c 2 ) . 

5 See Exercise 1. 


( 12 . 11 ) 
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The inverse is equivalent to exchanging (x, t ) and (x', t') and replacing v by — v, 
because .T moves with velocity —v with respect to T' , 6 

Now that we know how to satisfy Postulate 2, by the Lorentz transformation 
(12.10), what remains is to find theories that obey Postulate 1 for this transforma¬ 
tion. The equations of physics must be the same whether written in coordinates 
of T or T'. This property of the equations is called covariance. We shall analyze 
the covariance of electromagnetism in Sec. 12.3. 

12.1.3 ■ Examples Involving the Lorentz Transformation 

Before turning to electromagnetism, it is interesting to review some general con¬ 
sequences of the Lorentz transformation. In the examples below, T and T' denote 
inertial frames, and Amoves with constant velocity v = u i relative to T, as 
in Fig. 12.1. In each example we analyze the spatial and temporal coordinates of 
events observed in the two frames. 


EXAMPLE 1 Time dilation. Consider two events that occur at the same spatial 
point in T ', at times t[ and t' 2 . For example, these events could be two ticks of a 
clock at rest in T ', i.e., moving with velocity v in T. Or, the two events could be 
the creation and decay of an unstable elementary particle at rest in T'. The time 
interval between the events, with respect to T ', is At' = t 2 — t' v What is the time 
interval between the events as measured by an observer at rest in T ? 

The time interval At in T is ?2 —0 ■ To calculate times for T we need to use the 
inverse transformation (12.11), which gives t = y(t' + vx'/c 2 ). The time interval 
between the two events is At = y(At' + vAx'/c 2 ) but Ax' = 0 because the 
events occur at the same location in T'. Therefore, 

/ Ar' 

At = y At = =- — . (12.12) 

yi - v 2 /c 2 

Note that At > At'\ the time interval observed in T is longer than At'. 

According to the observer at rest in T , the time between the events is dilated 
compared to what it would be if the events were to occur at the same location 
in T. The observer in T says “the moving clock runs slow.” If the system is a 
clock that ticks every second (in its rest frame T '), then the ticks are y seconds 
apart in the time coordinate of T. For example, if v/c = 0.99 then y ~ 7. The 
observer in T , using a clock of identical construction, will observe 7 ticks of 
his clock between two ticks of the moving clock. As v approaches c, the dilation 
factor y tends to oo. 


Time dilation is an experimental fact. It is observed in the decay times of un¬ 
stable elementary particles. Unstable particles, created in high-energy collisions, 

6 See Exercise 2. 
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travel mean distance vyx before decaying; here v is the speed of the particle, r 
the mean lifetime in its rest frame, and y = l/^/l — v 2 /c 2 the dilation factor. If 
v approaches the speed of light, then y 1 and the mean distance is much larger 
than cr. 


EXAMPLE 2 Lorentz-FitzGerald contraction. Consider two points fixed on 
the x! axis. For example, these could be the endpoints of a ruler at rest in T ', 
i.e., moving with velocity v i in T. Let Ax' be the separation of the points in T'. 
What is the separation between the simultaneous positions of the two points in the 
frame T ? 

To calculate x values we again need to use the transformation (12.11), specifi¬ 
cally x = y(x' + i it'). Thus the distance between the two points is 

Ax — y Ax' + yv At'. (12.13) 

We are interested in the simultaneous positions of the two points in T , so A? = 0. 
But that means At' is not 0. Simultaneity is relative. By the Lorentz transforma¬ 
tion (12.11), At = 0 implies At' = —vAx'/c 2 . Thus the separation of the points 
in Jms 


Ax = y (l — v 2 /c 2 ^ Ax' = y 1 — v 2 /c 2 Ax'. (12.14) 

Note that Ax < Ax'; the length observed in T is shorter than Ax'. 

For example, suppose a meter stick is laid along the x' axis and is at rest inT'. 
If u/c = 0.99 then the instantaneous length of the meter stick in T is 0.14 m. 

According to the observer at rest in T , the length is contracted compared to 
what it would be if T' were at rest. The observer in T says “the moving ruler is 
short.” This effect was first proposed by FitzGerald, and later independently by 
Lorentz, to explain the null result of the Michelson-Morley experiment. However, 
their idea was that motion through the aether caused a physical contraction of the 
apparatus. Einstein’s theory is different. It was Einstein who first understood that 
the distance between points in space does not have an absolute measure; lengths 
are different for observers in relative motion. 


EXAMPLE 3 Addition of velocities. Suppose a particle P has velocity u' i with 
respect to the frame T '. What is its velocity with respect to the frame T ? Recall 
that T' moves with velocity v i with respect to T. 

In Newtonian physics the answer would be that the velocity of P in T is (u! + 
v) i. We know that is wrong. To determine the velocity of P in T , we must apply 
the Lorentz transformation (12.11). Obviously the velocity is in the x direction, 
of the form u i . Let x p and x' p denote the particle’s x coordinate in T and T' , 
respectively. Then the x component of the velocity of P with respect to T is 
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dx p Y(dx' p + vdt ') 

^ y | dt' + vdx'p/c 

(12.15) 

or, using that dx' p /dt' is n', 


u' + v 

U = -r. 

1 + vu' /c 2 

(12.16) 

Einstein called (12.16) th ztheorem for addition of velocity. If u' and v are small 
compared to c, then u is approximately the classical result u' + v. For example, 
if u' = 0.1c and v = 0.1c then u = 0.198c. But if u' or v approaches the speed 
of light then u also approaches the speed of light. For example, if u' = 0.9c and 
v = 0.9c then u = 0.994c. A massive particle does not move faster than c in any 
inertial frame. If the particle is massless, so that u' = c, then it moves at the speed 
of light in all inertial frames; indeed for u' = c the equation gives u = c for any v. 

More generally, velocity is a vector u. The transformations of velocity compo¬ 
nents can easily be derived by the same method as (12.16), with the results 7 

u\ | + V 

1 + Dt/||/c 2 

(12.17) 

U 1 

U± y(l + vu'u/c 2 )' 

(12.18) 


12.2 ■ MINKOWSKI SPACE 


12.2.1 ■ 4-vectors, Scalars, and Tensors 

The Lorentz transformation is a linear transformation involving four coordinates. 
It is natural to combine the four coordinates in a 4-vector, and regard spacetime 
as a 4-dimensional space, called Minkowski space. We define the 4-vector x' 1 by 


/ X" \ 


' ct > 

x 1 


X 

x 2 

L V 3 i 


y 

i 7 J 


V * 3 / \zj 


(12.19) 


The notation here is important. The index p, and all Greek indices (v, p, a ,...), 
take the values 0, 1,2, 3. We use “suffix notation” for 4-vectors, so x 11 stands for 
the vector itself in some contexts, and for the p component of the vector in other 
contexts. It may be a little confusing at first, but we will regard x ,L as a symbol 
representing a 4-component vector. 

7 See Exercise 9. 
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The Lorentz transformation x M —> x' M is linear, i.e., of the form 


x 


'IX 


= Kx\ 


( 12 . 20 ) 


where A ll v is a matrix of constants. We use the Einstein summation convention 
throughout this chapter. Any repeated Minkowski index, such as v in (12.20), is 
understood to be summed from 0 to 3. So, for example, 

ct' = x' 0 = A° y x u 

= A° 0 x° + AV 1 + A° 2 x 2 + A° 3 x 3 

= A ° 0 ct + A°,x + A° 2 y + A° 3 z. 


The coefficients A fl v of (12.20) may be written as a 4 x 4 matrix, since both // 
and v run from 0 to 3. If T’ moves with velocity v i relative to T then by (12.5) 
the transformation matrix is 


A* 


( Y 

-Py 

0 

V 0 


-Py o o \ 

y 0 0 

0 1 0 

0 0 1 / 


( 12 . 21 ) 


where p = v/c and y = l/^/l — p 2 . If (12.20) is written as a matrix equation, 
and then expanded in its four components, the result is the Lorentz transformation 
equations ( 12 .5 ). 8 


Vectors and Tensors 

Lorentz vectors and tensors are defined with respect to Lorentz transformations, 
in the same way that 3-dimensional vectors and tensors are defined with respect 
to rotations. A 4-vector a 11 is a set of four components that transform in the same 
way as x M under Lorentz transformations, 

a'^ = A %a v . (12.22) 

A tensor T I1V is a set of 16 components that transform in the same way as x ll x v , 

T'txv _ (12.23) 

(Note that p and a are summed !) 9 It is also useful to define a quantity a lL with 
lower index, associated with the arbitrary 4-vector a^, by 


8 See Exercise 5. 

9 Since ix and v can each take four different values, (12.23) is a set of 16 equations; and the right-hand 
side of each equation is a sum of 16 terms. 
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^ ao ^ 


( ~ a ° \ 

a\ 


a x 

<*2 


a 2 

\a 3 ) 


\ « 3 / 


(12.24) 


In this formalism there are two types of 4-vectors: A 4-vector with an upper index 
a fl is called a contravariant vector, and one with a lower index a )L is called a 
covariant vector. 10 The two types are different and must be distinguished by the 
upper or lower placement of the index. Tensors with lower indices are defined 
similarly. For example, 7^,, is defined from T I1V by 

roo^T 00 , Tij = T ij , T 0i = -T 0i , Tio = —T i0 . (12.25) 

Raising or lowering a temporal index (0) changes the sign, but raising or lowering 
a spatial index makes no change. We use roman indices ( i , j, k,...) to denote 
spatial indices. Any roman index takes the values 1, 2, 3, which correspond to 
the x, y, z Cartesian components, respectively. The Lorentz transformation rule 
for a covariant vector is = a v (A -1 ) l j ( , where (A“ 1 ) l ' M is the inverse of the 
matrix A^. 11 


The Lorentz Product 

The Lorentz product of two vectors, which plays the role of the scalar product in 
the Minkowski vector space, is defined by 

a b — = -~a°b° + a • b. (12.26) 

We shall prove the following crucial theorem: a b is a Lorentz scalar, i.e., invariant 
under the Lorentz transformation . 12 The proof is just an algebraic exercise. It is 
sufficient to consider the transformation in ( 12 . 21 ): 

a! -b' = -a ,0 b ,Q + a' l b n + a a b a + a'V 3 

= -y (a 0 - Pa 1 ) y (b° -/3b ] ) + y (a ] - /?a°) y (V - fib 0 ) 

+ a 2 b 2 + a 3 b 3 

= -a°b° + a l b l + a 2 b 2 + a 3 b 3 = a ■ b. 


10 The formalism of contravariant and covariant vectors has largely replaced the older formalism in 
which the fourth coordinate is an imaginary number ict. 

11 See Exercise 6. 

12 The analogue for 3-dimensional vectors is that the dot product a b is invariant under coordinate 
rotations. 
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12.2.2 ■ Kinematics of a Point Particle 

With the mathematical construction of Minkowski space in hand, we are now 
prepared to study the consequences of Postulate 1 for particle motion. First we 
need to introduce some more jargon. 13 The worldline means the trajectory of 
the particle (we’ll call the particle P) in Minkowski space. That is, the worldline 
is a curve in four dimensions—the locus of points in spacetime touched by the 
particle. Also, the proper time of P, which we denote by r, is defined by the 
differential relation 


cdr = yj—dx^dx^ = yj c 2 (dt) 2 — (dx) 2 , (12.27) 

where dx 1 ' is an infinitesimal displacement along the worldline. Note that dr, 
and therefore also r, is a Lorentz scalar. We may also express dr in terms of 
the particle velocity. Let u denote the velocity of P in the inertial frame T , i.e., 
u = dx/dt. Then in terms of the time coordinate of frame T , the proper time 
interval is 


dr =dty/l-u 2 /c 2 . (12.28) 

Note that dr is the time interval experienced by P in its rest frame T p , as P moves 
by dx^ in T . In the rest frame T p the velocity u p is 0, so by (12.28) dr — dt p . 
In words, the proper time is the particle’s own time. 


4-velocity and 4-momentum 
The 4-velocity of P is defined by 


4 



(12.29) 


where again dx 1 ' is along the worldline. The temporal and spatial components of 
4-velocity are 


yj 1 — U 2 /C 2 

u 

yj 1 — U 2 /c 2 


(12.30) 

(12.31) 


using the definition u — dx/dt. An important point is that if is a 4-vector, be¬ 
cause dx^ is a vector and dr a scalar. That is, the transformation of 4-velocity is 
simple: if 11 — Afr] v . In other words, if and rj transform in the same way as ct 
and x under Lorentz transformations. The 3-dimensional vector u has a compli¬ 
cated transformation rule, 14 but the 4-vector if transforms simply. 

13 Jargon is important in science because the words are defined precisely. 

14 See Exercise 9. 
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The 4-momentum of P is defined by 

= mr]» (12.32) 

where m is the rest mass. The rest mass is a Lorentz scalar. 15 The 4-momentum 
is a 4-vector because rf L is a 4-vector and m a scalar. 


Energy and Momentum 

The temporal component of p M is identified as E/c, where E is the total particle 
energy. The spatial components make up the spatial momentum p. That is, 


(p° \ 

P l 


( E/c \ 


Px 

p 2 


Py 

\ p 3 ) 


\Pz / 


(12.33) 


Writing these components in terms of the particle velocity u in an arbitrary inertial 
frame T, using (12.30) and (12.31), the relativistic formulas for particle energy 
and momentum are 


n me 2 

E = mryc = — , (12.34) 

y/1 ~U 2 /c 2 

mu 

p — mr] = . . . (12.35) 

y/l - U 2 /c 2 

By eliminating u we may relate energy and momentum , 16 

E = Jm 2 c 4 + p 2 c 2 . (12.36) 

If p is 0 then the energy is me 2 , the rest energy. If |p| <$C me then E is approxi¬ 
mately me 2 + p 2 12m, the rest energy plus classical kinetic energy; this approx¬ 
imation is a Taylor expansion of (12.36) for small |p|. \i p me then E is 
approximately pc, which is the energy of a massless particle. 

Why are (12.34) and (12.35) the correct definitions of energy and momentum? 
Of course the reason is that they agree with experiment, e.g., collisions of high- 
energy particles. But why, theoretically, are they the correct equations? How did 
Einstein deduce them before there were any high-energy experiments? The an¬ 
swer is that the equations must satisfy Postulate 1 for the laws of conservation 
of energy and momentum. It is a basic law of physics that the total energy and 
momentum are conserved in a collision of particles; indeed, that is the defining 

15 In some discussions of special relativity, a velocity-dependent mass is introduced. In our treatment 
of the theory, m always denotes the rest mass, which is an invariant property of the particle, i.e„ a 
scalar. 

16 Note that E 2 — p 2 c 2 = m 2 c 4 . 
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property of energy and momentum. According to Postulate 1 this law must hold 
in every inertial frame. If E and p make up a 4-vector, as in (12.33), then the lin¬ 
earity of the Lorentz transformation guarantees that if energy and momentum are 
conserved in one inertial frame then they are conserved in every inertial frame. 
For example, consider a 2-body elastic collision. Conservation of energy and mo¬ 
mentum in .Pis that P^ al = P/* tial , where P >1 denotes the total 4-momentum. 
Because the Lorentz transformation is linear, in any other frame T' an observer 
would find 


PZ al = A/ X n al = A^itia, = initial- (12-37) 

That is, energy and momentum are conserved in T'. Equations (12.34) and 
(12.35) are correct because they make p ,L — ( E/c , p) a 4-vector, so that P >1 is 
conserved in all inertial frames. 

Covariance of the conservation laws requires (12.34) and (12.35) for energy 
and momentum. 

12.2.3 ■ Relativistic Dynamics 

Postulate 1 states that the equations of physics m ust have the same form in all iner¬ 
tial frames. This property of the equations is called covariance. There is a method 
for writing equations that guarantees covariance, which is to write the equations 
as relations involving only 4-vectors, scalars, and tensors. That is, covariance is 
automatically true if the equations of a theory are written in Minkowski tensor 
form. The generic equation then has the form 17 

T( 1V = T{ lv (12.38) 

in some reference frame T. In any other frame T' the equation has the same form, 
because by the definition of a tensor, 

r/ MU = A P a A v p Tf = A ^ A v p Tf = T^ V . (12.39) 

Postulate 1 is equivalent to the statement that all the laws of physics can be 
written in Lorentz tensors. We have already seen an example of this fact, in the 
equation for energy and momentum conservation: P^ al = P ialtia] . An equation 
written in tensors is said to be manifestly covariant. 


The Equation of Motion of a Particle in Covariant Form 
Newton’s second law is written in the spatial coordinates and time of some inertial 
frame: dp/dt — F(x, t ), where p is the spatial momentum and F the Newtonian 
force. But this equation is not manifestly covariant. To construct a manifestly 
covariant equation we must write the equation in 4-vectors and tensors. In analogy 

17 Tensors are defined with any number of indices. A rank-two tensor has two indices, as in (12.23), 
or as in the hypothetical generic equation (12.38). A rank-one tensor is a vector. 
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with the Newtonian equation, we write 


d p 11 
dr 


= K'\ 


(12.40) 


where is called the Minkowski force. In (12.40) the proper time is used as the 
independent variable to parametrize the position along the worldline in spacetime. 
The left-hand side is a 4-vector because r is a scalar. Einstein’s Postulate 1— 
covariance—requires that K' 1 must also be a 4-vector, i.e., transform according 
to (12.22). Then (12.40) is the manifestly covariant equation of motion. 

To appreciate the meaning of the Minkowski force, it is useful to relate its 
components K and K° to the Newtonian force F, which is defined as dp/dt. The 
expressions are 


dp dp/dt F 

dr dx/dt /I - w 2 /c 2 ’ 

IdE _ F • u/c 
c dx y 1 — h 2 /c 2 


(12.41) 

(12.42) 


In the second expression we have used the work-energy theorem, dE = F udt. As 
in the case of 3-velocity, the Newtonian force F has a complicated transformation 
rule. But the Minkowski force transforms in a simple way—as a 4-vector. 

Our goal in the next section is to put the equations of electromagnetism into 
manifestly covariant form. 


12.3 ■ ELECTROMAGNETISM IN COVARIANT FORM 


12.3.1 ■ The Lorentz Force and the Field Tensor 


We know that in any particular frame T , the equation of motion of a charge q 
moving under the influence of fields E and B is 


—— = F = qE + qu x B. (12.43) 

dt 

This F is called the Lorentz force. Indeed we used (12.43) to define the fields E 
and B in earlier chapters. But (12.43) is not written in Lorentz vectors and tensors. 
Our goal is to write the equation in tensors. In other words, we shall determine 
the Minkowski force K fl that corresponds to (12.43), such that dp 11 /dx = K fl . 
As a by-product, we shall leam how the fields E and B transform under a change 
of inertial coordinates. 

The components of K' 1 may be written concisely in terms of the 4-velocity 
defined in (12.30) and (12.31). Using (12.41) the spatial componentsof K fl are 


K = 


^E+pxB 

yi- u 2 /c 2 


0 E 

q >1 -E qr] x B; 

c 


(12.44) 
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or, in suffix notation, 


K‘ 


E l 

■ ( i T l 0 — +q*ijkri J B k , 


(12.45) 


where repeated roman indices (here j and k ) are always summed from 1 to 3. (f-,jk 
is the Levi-Civita tensor.) Using (12.42) the temporal component of K 11 is 


K° 


qE ■ u/c E 

.- = = qrj- -. 

y i— u 2 /c 2 c 


(12.46) 


The brilliant insight of Minkowski was to combine E and B in an antisym¬ 
metric Lorentz tensor, which we call the electromagnetic field tensor, denoted 
by F liv (x). The spatial and temporal components of the tensor are defined by 


F' J = e ijk B k , 

(12.47) 

pi 

F i0 = —, 
c 

(12.48) 

8 

II 

o 

(12.49) 


As usual, roman indices ( i , j, k, ...) are spatial, taking values 1, 2, 3 that corre¬ 
spond to Cartesian components x, y, z; for example, E l = E x or = B z . The 
field tensor is antisymmetric: F v/l = — F /JU . Written as a 4 x 4 matrix the field 
tensor is 


/ 

0 

E x /c 

Ey/C 

EJc 

\ 


~E x /c 

0 

B z 

-By 



-Ey/c 

~Bz 

0 

B x 


V 

~E z /c 

By 

~B X 

0 

/ 


(12.50) 


The Minkowski force takes very simple form in terms of F ,LV . We shall verify 
that the Minkowski force on a point charge q is 


=qq v F IJ - v . 


(12.51) 


We must show that (12.45) and (12.46) are correctly given by (12.51). First con¬ 
sider the ith spatial component of (12.51) 

K' =q (q 0 F'° + qjF lJ ) = + qe ijk r] J B k ; 

to justify the second equality note that qo = ~q° and iy = q J . The result is 
(12.45). Next consider the temporal component of (12.51) 

K° = q (qoF 00 + qj F 0j ) = qq^] 
this result is (12.46). Hence (12.51) is proven. 
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TABLE 12.1 Equations of electromagnetism in covariant 
and component form 


Covariant form 

Component form 

d x 

| dp/dt = q(E + u x B) 

| dE/dt = qu ■ E 

^ = o 

dxV 

dp 

VI — - — 
dt 

dF^ 

- = U() J 1 

dx v 0 

( V • E = p/(Q 

| VxB-(l/c 2 )9E/a; = ri 0 J 

dG^ v 
dx v ~° 

f V- B = 0 

{ V x E + dB/dt =0 

dA v dAft 

| E = —VV — dA/dt 
\ B = V x A 


The 4-velocity rj 1 ' and Minkowski force K' 1 are Lorentz vectors. Therefore 
(12.51) implies that F I1V must be a Lorentz tensor. That is, it must transform under 
a change of coordinates in the manner of a tensor (12.23). Just as the Lorentz 
product of two vectors a v b v is a scalar, the similar product of a vector and a 
tensor a v T^ v is a 4-vector. 18 The right-hand side of (12.51) is a 4-vector if and 
only if F ,lv is a tensor. From this requirement we will deduce the transformations 
of E and B, in Sec. 12.4. 

In conclusion, the equation of motion of a particle of charge q is, in covariant 
form. 


-^-=qr, v F» v . (12.52) 

ax 

Table 12.1 lists the equations of electromagnetism in both covariant and com¬ 
ponent form. The first entry in the table is the equation of motion of a charged 
particle. 


12.3.2 ■ Maxwell's Equations in Covariant Form 

By writing the equation of motion of a point charge in covariant form, we have 
identified the electromagnetic field as an antisymmetric tensor F 1 ' v . In accord 
with Einstein’s Postulate 1, it must also be possible to write the Maxwell field 
equations in tensor form. 

As a first step, we consider the charge and current. The densities p(x, t) and 
J(x, t) are not covariant, but they combine to make a 4-vector current J IJ (x), 

18 In general, the product of a vector and a tensor of rank n is a tensor of rank n — 1. 
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defined by 


( C P \ 
J x 

Jy 

V / 


(12.53) 


The continuity equation, which is an essential part of electrodynamics, is, in co¬ 
variant form, 

dJ tl 

— = 0. <12.54, 

A subtle point here is that d/dx 1 ' transforms as a covariant vector, not a con- 
travariant vector. 19 Therefore 3 J >1 /Bx fl is, like the Lorentz product of vectors, a 
scalar. Local conservation of charge requires that this scalar is 0. It is a simple ex¬ 
ercise to verify that in component form (12.54) is the familiar continuity equation, 
V • J = -3p/3f. 

Two of the Maxwell equations relate fields and sources—Gauss’s law and the 
Ampere-Maxwell law. Gauss’s law is a scalar equation, i.e., a scalar with respect 
to spatial rotations. The Ampere-Maxwell law is a vector equation with respect to 
rotations. The two equations together make one covariant 4-vector equation, i.e., 
covariant with respect to Lorentz transformations, which is 

3F ilv 

——=1X0^. (12.55) 


The temporal component (/z = 0) of (12.55) is Gauss’s law. Note that dF 0v /dx v 
is V - E/c; the term in the sum over v with v = 0 is 0 because F 00 = 0. For /z = 0 
the right-hand side of (12.55) is p/(eoc), because t l o c is equal to l/(eoc). Hence 
the temporal component of (12.55) is equivalent to V • E = p/eo. 

The spatial components of (12.55) are the Ampere-Maxwell law. Note that 
dF iv /dx v is a sum of four terms. The term with v = 0 is — (\/c 2 )dE‘/dt, and the 
sum over v = 1, 2, 3 is the i component of V x B. Hence the spatial components 
of (12.55) are equivalent to V x B — (l/c 2 )3E/3r = /zoJ. Table 12.1 lists side- 
by-side the field equations in covariant form and component form. The first two 
Maxwell equations appear as the third row in the table. 

The other two Maxwell equations involve only the fields, not charge. With re¬ 
spect to spatial rotations, these equations are a scalar equation (Gauss’s law for B) 
and a vector equation (Faraday’s law). With respect to Lorentz transformations, 
the two equations together make one covariant vector equation, which may be 
written as 


19 


3G ,tv 

dx v 


= 0 . 


(12.56) 


See Exercise 14. 
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G' lv is another antisymmetric tensor, called the dual field tensor. It is defined in 
covariant form by the relation 

G uv = l e ^P Fapt (12.57) 

where e^vap, j s t [ le completely antisymmetric tensor in four dimensions. This ten¬ 
sor is the generalization to four dimensions of the Levi-Civita tensor in three 
dimensions: The component is 0 unless all four indices are different; and 
e liVCl P is +1 or —1, respectively, if /xvafi is an even or odd permutation of 0123. 
So, for example, € 0ijk = Gjk . Equation (12.57) is equivalent to these spacetime 
components of G^ v : 


G U = - -Gjk^, 

(12.58) 

•G'° = y° ijk Fj k - B‘, 

(12.59) 

O 

o 

o 

II 

p 

(12.60) 


It is an interesting exercise to verify (12.58) and (12.59). Note that in (12.57) the 
nonzero contributions for /iv — ij have afi either Ok or kO, because e flva ^ is 0 
if any two indices are equal; hence (12.58). Also, the nonzero contributions for 
fi = 0 in (12.57) have va.fi all spatial ijk; hence (12.59). In matrix form 


/ 

G^v _ 

\ 


0 

~B X 
-By 

-B z 


B x 

0 

E z /c 

— Ey/C 


By B z \ 

-Ez/c E v /c 

0 -E x /c 

E x /c 0 / 


(12.61) 


The dual tensor has the same form as F ,Av , but with the replacements E/c —> B 
and B —► —E/c. 

As an exercise, it it straightforward to verify that the temporal component (t L = 
0) of (12.56) is Gauss’s law (V • B = 0); and the spatial components (/t — 1, 2, 3) 
make up Faraday’s law (V x E + 3B/3r = O). 20 These results are the fourth row 
in Table 12.1. 

Maxwell’s equations describe many physical phenomena. The fact that these 
equations reduce to the very compact form 


dF flv /dx v = jJL 0 J 11 and dG^/dx v = 0, 


indicates there is something really deep here! 


12.3.3 ■ The 4-vector Potential 

In earlier chapters we have often noted the importance of the potentials V (x, t) 
and A(x, t). In the covariant form of electromagnetism, these potentials combine 

20 See Exercise 15. 
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to make a 4-vector potential A tJ (x), defined by 


/ V/c 



\ 


(12.62) 


The field equation dG llv j'dx v = 0 is automatically satisfied if F /lv is written as 


p/J-v _ 


dA v 

9x m 


dA» 
dx v ' 


(12.63) 


Note also that this formula automatically makes F /IV antisymmetric. The compo¬ 
nents of the tensor equation (12.63) are given as the final row of Table 12.1. They 
are the familiar relations between the potentials V and A and the fields E and B. 


12.4 ■ FIELD TRANSFORMATIONS 

The electromagnetic field is a Lorentz tensor F /lv , so the transformation of the 
field from one inertial frame T to another T' is prescribed by the general defini¬ 
tion (12.23) of tensor transformations, 

F'» v (x') = A» p A v a F pa (x) (12.64) 

where = A ll . x x . In this section we will split this field transformation into the 
separate parts for E' and B'. 

First, to be definite, assume that the velocity of T' relative to T is u i, in the x 
direction. Then the transformation matrix is (12.21). The electric field that would 
be measured by an observer at rest in T' is 

E n = cF m = cA° p A' a F pa . (12.65) 

The right-hand side is a sum of 16 terms, because p and a run from 0 to 3. 
However, most of the terms are 0 because of the form (12.21) of A*\. For i = 1 
the only nonzero terms have pa = 01 and 10; thus, since E l = E x , 

E' x = c Y 2 F m + c(-Py) 2 F 10 = cF 01 = E x . (12.66) 

For i = 2 the nonzero terms have pa = 02 and 12, so 

E' y = cyF 02 - cpyF n = y {E y - vB z ); (12.67) 

and similarly, for i = 3, 

E' z = cyF m - cflyF 13 = y(E z + vB y ). 


( 12 . 68 ) 
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TABLE 12.2 Field transformations between 
frames T and T ', where T' moves with velocity n i 
with respect to T 


E' x = E x 

B' x = B x 

E y = Y ( E y - vB z ) 

B 'y = Y (By + vE z /c 2 ) 

E' z = y ( E z +vB y ) 

B 'z =Y (b z - vE y /c 2 ) 

Ex = E' x 

B x = K 

E y = Y (E' y + vB' z ) 

B y = y(B' y - vE'Jc 2 ) 

E z = y ( f ' - vB' y ) 

B z =y (B' ? + vE'y/c 1 ) 


Table 12.2 lists the field transformation equations for this case in which the ve¬ 
locity of T' with respect to T is u i. 

The magnetic field in T' is 

B" = F' jk = A j p A k a F pa , (12.69) 

where ijk is a cyclic pennutation of 123. For i = 1, jk is 23; then the only 
nonzero term in the sum over p and a has pa — 23; that is, 

B' x = F 23 = B x . (12.70) 

For i = 2, jk is 31; then the nonzero terms have pa = 30 and 31, so 

By = -£yF 30 + yF 31 = y (s y + vE z /c 2 ) ; (12.71) 

and similarly, for i = 3, jk = 12, and so 

B' z = -pyF 02 + yF 12 = y (b z - vE y /c 2 j . (12.72) 

These magnetic field transformations are also listed in Table 12.2. 

For an arbitrary relative velocity v the field transformations are, by generaliz¬ 
ing the previous special case, 

£[l = £|| and E'j_ = y (Ej_ + v x B_l) , (12.73) 

B[ = B\\ and B' ± = y(Bi-vx E ± /c 2 ^ . (12.74) 

As before || and _L denote vector components parallel and perpendicular to v. 
The inverse transformation equations are obtained from (12.73) and (12.74) by 
making the replacements E -o- E', B -o- B', and v -o- — v. Given fields E and 
B in the inertial frame T , however created, the fields that would be measured by 
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TABLE 12.3 Lorentz transformations of various quantities. The 
inertial frame T' moves with velocity v with respect to frame T. 
The components denoted || and _L are parallel and perpendicular 
to v. 


Coordinates 

t' = y{t - wq|/c 2 ) 

t = y(t' + ux[|/c 2 ) 

II 

X 

1 

e 

*11 = y(x[| + vt') 

X ± = X X 

x_L = *x 

Energy and momentum 

E' = y(E - up||) 

E = y(E r + upj|) 

P'\\ = Y(P\\ ~ vE /c 2 ) 

P\\ = Y(P\\ + vE'/c 2 ) 

PX = P-L 

Px = p'x 


Velocity 

K|| = (M|| - «)/(l - VU\\/C 2 ) U\\ = («j| + !>)/(l + V U \/C 2 ) 

»X = (1/K)UJ_/(1 - vu\\/c 2 ) uj_ = (l/y)u' ± /(l + vu'^c 2 ) 


Electric and magnetic fields 


E [ = E \\ 

JTl 

II 

E'j_ = x(Ex + v x Bx) 

Ex = y(E'j_ - v x B'j_) 

II 

_Co 

03 

II 

B'x = y(Bx -V xEx/c 2 ) 

Bx = y(B'j_ + V X E'j_/ c 2 ) 


an observer at rest in T' are (12.73) and (12.74), where v is the relative velocity 
of T '. Or, given E' and B', the inverse transformation determines E and B. For 
archive purposes we record the general transformation for the fields in Table 12.3, 
along with the transformation rules for some other quantities. 

The field transformations mix electric and magnetic fields. For example, sup¬ 
pose there is a set of charges that are at rest in T '. Then the magnetic field B' 
is 0. An observer in T' experiences only an electric field. A bar magnet at rest 
in T' experiences no torque. However, an observer in T experiences both elec¬ 
tric and magnetic fields due to the same set of charges. A bar magnet at rest in 
T experiences a torque. This is not surprising: The charges move with velocity v 
in T , and their current creates a magnetic field. One important point is that if we 
want to calculate B(x, t) and E(x, t) for the inertial frame T, it is not necessary 
to solve Maxwell’s equations in the coordinates of T . Instead, we only need to 
calculate the Lorentz transformation of E'(x'), listed in Table 12.3. We shall use 
this method to determine the fields of a moving charge in the next section. 

In special relativity, t and x combine into a 4-vector x M . Also, E and p com¬ 
bine into the 4-momentum p ,x . Quantities that were considered to be separate in 
classical physics, like time and space, or energy and momentum, are unified by 
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the theory of special relativity. A change of inertial frame produces mixing of 
these quantities. Similarly, the fields E and B combine into the field tensor F llv . 
The tensor structure specifies how the fields in one frame of reference are related 
to those in another frame. 


EXAMPLE 4 Suppose in an inertial frame T there is a uniform electric field E = 
£ok, but no magnetic field, B = 0. What are the fields in a frame T' that moves 
with velocity v = v i with respect to T ? 

A physical realization of this example would be the space between parallel 
charged plates with surface charge densities ±ao. See Figure 12.2; the lower plate 
is positive and the upper plate negative. In the rest frame of the plates (T ) the 
electric field is (oo/eo)k by Gauss’s law. Now imagine a tiny observer moving 
between the plates with constant velocity v i and making field measurements in 
her laboratory (T '). 

We’ll use Table 12.2 to determine E' and B'. The only nonzero field component 
in frame Tis E z = £o- By Table 12.2 

E' x = 0, E' y = 0, and E' z = yE 0 ; (12.75) 

and 

B' x =0, B' y = ^-jEo, and B[ = 0. (12.76) 

In T' there is a uniform electric field y £<)k and a uniform magnetic field 
yvEo/c 2 }. 

The same results can also be derived by identifying the sources in T' . In 
the primed coordinate system there are two charged plates with surface densi¬ 
ties icr' = ±y<To. The factor y comes from the Lorentz-FitzGerald contraction 
of the plate in the x direction, which increases the density compared to T . The 
electric field E' due to the surface charge is (<r'/f o)k, which is y£ok. Also, in 
T' there are two current sheets because the charged plates move (in T ') with ve- 


z 

t 

i 

i 


2'+ v 



+ + + + + + + 


FIGURE 12.2 Example 4. T' moves with velocity v i relative to T , which is the rest 
frame of a pair of charged parallel plates. 
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locity — v i; the surface current densities in T' are KJ = —o'v i for the lower 
plate and K' u = +a'v i for the upper plate. The magnetic field between the plates 
is then B' = noa'v j by Ampere’s law, which is ( yvE^/c 2 ) j. 

Note that the fields observed in T' satisfy the relation 

B' = -vx E'/c 2 (for B = 0). (12.77) 

This is a special case of a general theorem: If B = 0 in some inertial frame T, then 
(12.77) holds in any other inertial frame T '; v is the constant velocity of T' with 
respect to T 21 


EXAMPLE 5 What does an electromagnetic wave look like in a moving frame of 
reference? More precisely, consider a plane wave traveling in the x direction and 
polarized in the z direction 22 

E(x, 

B(x, 

What are the fields observed 
spect to T ? 

We can derive E'(x', f') and B'(x', t') using Table 12.2. Let’s start by writing 
kx — cot in primed coordinates, since that variable contains the spacetime depen¬ 
dence. By the Lorentz transformation, 

kx — cot = ky(x' + vt') — coy(t' + vx'/c 2 ) 

= k'x' - co't' 

where we introduce new wave parameters k' and co for the primed frame, 

k! = y [k-vco/c 1 ^ (12.79) 

co' = y (co — vk). (12.80) 

(This transformation is equivalentto the statement that ( co/c , k) forms a 4-vector.) 
For an electromagnetic wave in vacuum, co = ck. Therefore the frequency ob¬ 
served by an observer at rest in T' is 


t) = E ocos(kx — wt)k, 
Eo 


t) =- cos(kx — cot) j . 

c 


(12.78) 

in a frame T' that moves with velocity v i with re¬ 


ft/ = y (1 — v/c) CO 


1 — v/c 

- -—ft) 

1 + v/c 


(12.81) 


21 See Exercise 17. 

22 In (12.78) we have used the property of a plane wave that B = E/c. 
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This result is the Doppler shift for light. If v > 0 then the wave observed in T' has 
a lower frequency, i.e., a red shift, compared to T ; if v < 0 there is a blue shift. 
Note that the wave parameters for T' also obey to' — ck', which implies that the 
wave speed is the same in T' as in T. 

Now we refer to Table 12.2 to find the field components observed in T '. The 
nonzero components in .Fare E z and B y . According to the transformations in 
Table 12.2, the nonzero components in T' are £' and B' y . For the electric field we 
find 


E’. = y (Eo — vEo/c) cos (kx — cot) 

= E' 0 cos(k'x' - co't') (12.82) 


where the amplitude in T' is 


E' 0 = y(l — v/c)Eo = yY^T~f~Eo- (12.83) 

For the magnetic field we find 

By = Y Eq/c + vEo/c 2 ^) cos (kx — cot) 

E’ 

=-- cosOt V - co't'y, (12.84) 

c 

again we have the relation B' = E'/c. The fields E'(x', t') and B'(x', t') form 
a transverse electromagnetic wave with the usual properties required by the 
Maxwell equations. The field transformations of an electromagnetic wave yield an 
electromagnetic wave. This is consistent with—indeed demanded by—Einstein’s 
Postulate 1, covariance. 

So, to answer the original question, the fields observed in T' form an electro¬ 
magnetic wave. However, there are some differences compared to the wave in T. 
In addition to the Doppler shift of frequency (or wavelength) there is a change in 
the amplitude of the field oscillations, given by (12.83). If v > 0, i.e., the observer 
moves in the direction of the wave, there is a red shift and a decrease in amplitude. 
In the limit v —>■ c the wavelength tends to oo and the amplitude tends to 0. If 
v < 0, i.e., the observer moves in the opposite direction, there is a blue shift and 
an increase in amplitude. 


12.5 ■ FIELDS DUE TO A POINT CHARGE IN UNIFORM MOTION 

This section describes an example of the use of special relativity—the Lorentz 
transformation of coordinates and fields—to calculate fields. The example is very 
simple and basic: Consider a charge q moving with constant velocity with respect 
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to an inertial frame T. What are the electric and magnetic fields? The reader 
should appreciate that this very basic question has not been addressed in any pre¬ 
vious chapter, except for the special case of a charge at rest. 

To be specific, let the velocity of q be v i in T , along the x axis. Then the 
position of q as a function of time is x q (!) = vt i. We will derive E(x, t) and 
B(x, t) by Lorentz transformation from the rest frame of q, which we call T'. 
The charge q is fixed at the origin of T '. In T ', the magnetic field B' is 0, and the 
electric field E' is simply the static Coulomb field 


E'(x') 


_q_ _ 

4ne 0 |x'| 3 ’ 


(12.85) 


where x' = x' i 4- y' j + z'k is the field point in T' coordinates. The fields E(x, t) 
and B(x, t) observed in T are to be calculated from the inverse transformation 
(the lower rows) in Table 12.2. Note that Amoves with respect to T with the 
same velocity v = v i as q. 

The field point has coordinates x,y,z and time t in frame T , and coordinates 
x' ,y', z' and time t' in T' . These coordinates and times are related by the Lorentz 
transformation (12.5). Therefore the vector x' in T' , from q to the field point, but 
reexpressed in the coordinates of T , is 

x' = y(x - i>r)i + yj + zk, (12.86) 

where y = l/^l — v 2 /c 2 as usual. The distance |x'| observed in T' is 

l x, | = \/ y 2 (x - vt) 2 + y 2 + Z 2 . (12.87) 

The electric field components in T are E x = E' x , E y = y £', and E z = yE' v 
according to the field transformations in Table 12.2. Applying these transforma¬ 
tions to the Coulomb field (12.85), but reexpressed in coordinates of T, we obtain 
the electric field observed in T , 


E(x, t) = 


qy (x - upi +yj + zk 
471-60 [ y 2( JC _^ ) 2 + ) ,2 + z 2]3/2- 


( 12 . 88 ) 


Recall that in T the position of q at time t is vt i. Therefore ( x — vt) i +y j +zk is 
the vector in T from q to the field point x. Equation (12.88) says that the direction 
of E(x, t) points radially away from the simultaneous position of q. 

If v = 0 then (12.88) is just the static Coulomb field, which is spherically 
symmetric, proportional to r/r 2 , and centered at the origin. If v is small, i.e., v <$C 
c, then to first order in v/c we may approximate y by 1. In this approximation 
E(x, t) is the Coulomb field centered at the moving position vt i of q. This limit 
is an example of the quasistatic approximation: The field has the same form as in 
electrostatics, but is centered at the slowly moving position of q. 
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FIGURE 12.3 Electric field lines of a moving charge q. The speed of q in the two 
cases is (a) 0 and (b) 0.9 c. The density of lines is proportional to the field strength. 


If v is large, i.e., approaching c, then the electric field is quite different from 
the Coulomb field. First, because of Lorentz-FitzGerald contraction in the x di¬ 
rection, the field lines are compressed into a disk normal to the direction of mo¬ 
tion. An observer at rest at ( xo , yo, zo) in T would experience a sharp pulse of 
electric field as q moves past, with maximum strength when t = xq/v. Second, 
the magnitude of the field is increased by the factor y. The electric field lines for 
charged particles with speeds v = 0 and v = 0.9 c are compared in Fig. 12.3. 
Another way to visualize the electric field of a moving charge is shown in Fig. 
12.4, which shows the electric field vectors of q for v = 0 and v = 0.9, at points 
equidistant from the charge. 

The magnetic field components in J 7 are B x = 0, B y = —yvE'Jc 2 , and B z = 
yvE'y/c 2 , according to the field transformations in Table 12.2. Therefore we find 



FIGURE 12.4 Electric field vectors of a moving charge q. The speed of q in the two 

cases is (a) 0 and (b) 0.9 c. The field vectors are plotted, for the same scale, at points 
equidistant from the moving charge. 
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that the magnetic field observed in T is 


B(x, t) 


qyv 


(-jz + ky) 


47 T 60 C 2 [y 2 (.x — vt ) 2 + y 2 + Z 2 ] 


3/2 ' 


(12.89) 


The direction of B(x, t) is tangent to a circle around the x axis, i.e., curling around 
the path of q in the sense given by the right-hand rule 23 The ratio of magnetic 
and electric field strengths is 


|B| = v I y 2 + z 2 \ 1/2 
|E| c 2 y (x - vt) 2 + y 2 + z 2 J 


(12.90) 


If v is small compared to c, then |B| <£ |E|/c. 

If v is large, approaching c, then the magnetic field is compressed into a sharp 
pulse coinciding with the electric field pulse. At the center of the pulse ( x = vt) 
the ratio B/E is v/c 2 , which is approximately 1/c if v approaches c. Thus the 
electric and magnetic pulse due to a relativistic charged particle resembles a pulse 
of light: E and B are perpendicular, with B « E/c. 

To summarize the results we have obtained for E(x, t) and B(x, t) due to a 
charge moving with constant velocity, Fig. 12.5 represents a snapshot of the fields 
at an instant. In the space around a positive charge, E points away from the current 
position and B curls around the path of the particle. 



FIGURE 12.5 Fields of a moving charge. These are the fields at an instant. E points 
away from the position of q, and B curls around the trajectory. 

23 Note, too, that B = v x E/c 2 , consistent with the general theorem in Exercise 17. 
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EXAMPLE 6 Suppose a point charge moves with velocity v = 0.9c i along the x 
axis. For a point P on the z axis, at distance d from the origin, we’ll determine and 
plot the nonzero components of E and B as functions of time. The source point is 
at ( vt , 0, 0) and the field point P is at (0, 0, d) as shown in Fig. 12.6(a). 

Using (12.88) the nonzero components of the electric field at P are 


q -yfis 

Aneod 2 [ y 2p2 s 2 + j] 3 / 2 

(12.91) 

q y 

Aneod 2 [- y 2p2 s i + j] 3 / 2 

(12.92) 




FIGURE 12.6 A moving charge, (a) The charge q moves along the x axis, and the field 
point P is on the z axis at distance d from the origin, (b) The nonzero field components, 
plotted as functions of the dimensionless variable s = ct/d. (s is the time in units of d/c.) 
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where we have introduced a dimensionless variable s = ct/d proportional to the 
time. Using (12.89) the nonzero component of the magnetic field at P is 

q —yB 

By = - H -—— ttt • (12.93) 

• 4ne 0 cd 2 [ y 2 p 2 s 2 + i] 3 / 2 

These three functions are plotted in Fig. 12.6(b). In the graph the unit of electric 
field is q/(47teod 2 ). 


EXAMPLE 7 Fields of a moving line charge. As another example of the use 
of field transformations, consider a long line of charge at rest with respect to the 
reference frame T ', and lying along the x' axis. Let X' be the linear charge density 
in T'. To determine the field in T' is a problem in electrostatics. By Gauss’s law, 


E'(x') 


A'R 

2neoR' ’ 


(12.94) 


where R' is the perpendicular vector from the line to x'. (We treat the charged line 
as infinite.) The magnetic field B' is 0. 

Now let T be another inertial frame, relative to which the charged line (and T ') 
moves with velocity v i. What are the electric and magnetic fields that would 
be observed in T2 According to the field transformation (12.73), or rather its 
inverse, the electric field observed in .Fhas components £j| = E J =0 and Ex = 
y E^_. This field E should be expressed in terms of coordinates of T . Because R' 
is perpendicular to v, and the perpendicular coordinates do not change under a 
Lorentz transformation, so the perpendicular vector from the line to x is R = R'. 
Thus the electric field is 


E(x) = ---, (12.95) 

2neoR 

where A. = yX'. The direction of E is perpendicular to the moving line charge. 
The charge density in T is X = yX', increased by the factor y compared to X' 
because of Lorentz-FitzGerald contraction of the line. The electric field E is just 
the electrostatic field of a charged line with density X. 

But there is also a magnetic field in T . By the field transformation (12.74) it is 


B = yv x Ej_/c 2 = 


ixoXv x R^ 


27 tR 


(12.96) 


This field is just the magnetostatic field due to a steady current I = Xv i. (Note 
that i x R is a unit vector ip curling around the x axis.) Indeed, a charged line 
moving with constant velocity is a steady current I. By the field transformations, 
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we have merely reproduced the familiar static fields for a line charge and a line 
current. 


As an interesting extension of Example 7, consider two charged lines, that 
have equal but opposite densities ±Ao in their rest frames. Let the lines be par¬ 
allel to the x axis, but slightly separated so that they can move independently. 
As a first instance, suppose the lines move in the lab frame with equal but op¬ 
posite velocities ±iu/2, respectively. Because they have opposite charges, the 
oppositely moving lines are both equivalent to currents in the + i direction. Then 

* A 

the magnetic field is poIfr/(27tR), where is azimuthal around the x axis, and 
I = Xqv/ y/\ — u 2 /4c 2 . The electric field is 0 because the positive and nega¬ 
tive charges cancel. But as a second instance, suppose the positive line is at rest 
in the lab and the negative line moves with velocity — i v. In this case the mag¬ 
netic field is /xo/'0/(2 jt R), where I = "h oi >/— v 2 /c 2 . The electric field is 
(A+ — A._)R/(27reo/?), where A + —A_ is the net charge density, which is Ao(l — y). 
The negative line is Lorentz contracted, compared to the positive line, so there is 
a net negative charge and its corresponding electrostatic field. 

A current in a metal wire is analogous to the second case in the previous para¬ 
graph, because the electrons move opposite to the current while the positive ions 
are at rest. Therefore a current carrying wire in its rest frame has a tiny electric 
field. The field is immeasurably small because the relevant electron velocity is the 
drift velocity , which is typically smaller than c by 12 orders of magnitude. 


12.6 ■ MAGNETISM FROM RELATIVITY 

In Chapters 3-11 we studied the field theory of electromagnetism one piece at 
a time, by considering particular electric and magnetic phenomena. We started 
with electrostatics, went on to steady currents and magnetostatics, then time- 
dependent fields, and finally electromagnetic waves. The theory that emerges from 
describing these phenomena mathematically—Maxwell’s field theory—is consis¬ 
tent with the postulates of special relativity. Indeed it was the study of Maxwell’s 
theory that led Einstein to special relativity. The presentation of electromagnetism 
in Chapters 3-11 resembles the historical development of the subject. We might 
call this presentation the phenomenological approach , because it emphasizes the 
physical phenomena. This approach is inductive. 

We could, however, adopt a different presentation, which might be called the 
axiomatic approach. This approach is deductive, and more theoretical. The start¬ 
ing point would be to write down the minimal set of axioms from which the 
full structure of electromagnetic field theory can be deduced. The sufficient ax¬ 
ioms are Coulomb’s law, i.e., that the electric field of a point charge at rest is 
E = <7r/(47reof 2 ); and the principles of special relativity. These lead inexorably 
to Maxwell’s equations. In particular, magnetism can be derived as a necessary 
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consequence of special relativity. The purpose of this section is to see how mag¬ 
netism follows from relativity. 

We’ll start with an electric field. Consider a frame of reference T' in which 
there is a static electric field E', created by charges at rest. To an observer in T ', 
a test charge q moves under the influence of the electric force F', according to the 
equation of motion 




(12.97) 


To an observer in another frame T , relative to which T' moves with velocity v, 
the equation of motion is dp/dt = F. What is this force F? We may determine F 
from the Lorentz transformation. Refer to Table 12.3. The component of F parallel 
to v is 


dPl _ Y + VdE ' /c2 ) 

y [dt' + vdx'^/c 2 ^ 


(12.98) 


the first equality being the definition of F\\, and the second following from the 
Lorentz transformation of 4-momentum and 4-position. According to the work- 
energy relation, dE' = F' • u 'dt', where u' = dx!/dt' is the velocity of q relative 
to frame T '; thus, 


F,| = 


n +v )ic 2 r . 

- —- 2 -= F, + ymu • F Jc . 


(12.99) 


In the second equality we have used the velocity transformation (see Table 12.3) 
to rewrite the expression in terms of the velocity u of q in T. The final result is 

F l =qE\ l + ^u ± -E[ L . ( 12 . 100 ) 

By a similar calculation, the components of F perpendicular to v are 


F.l = 


dp±, 

dt 


<?E' X 

Y (l + ™<j|/c 2 ) 


( 12 . 101 ) 


Special relativity requires that the force on q, observed in frame T, is given 
by (12.100) and (12.101). It is not obvious yet, but we shall show that this force 
is in fact the Lorentz force 


F = qE + <?u x B, (12.102) 

where E(x, t) and B(x, t) are the fields in the frame T. Our task is to determine 
the fields such that (12.102) is the same as (12.100) and (12.101). If we succeed 
then we have shown that magnetism—the force qu x B—is a necessary conse¬ 
quence of relativity. 
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To proceed deductively is quite tedious. Instead, we shall state the result and 
prove that it is correct. The fields E and B are given by 

E\\ = and E_l = yE' ± (12.103) 

By = 0 and Bjl = ^-v x E' ± . (12.104) 

We need to prove that the Lorentz force with these fields is (12.100) and (12.101). 

The parallel component. Consider v • F = v fiy. For the Lorentz force, 

ufi|| = qvE\\ + q\ ■ (u x B). (12.105) 

Now, v • (u x B) is equal to u • (B x v) by a vector identity. In the expression we 
may replace B by Bx because the cross product of the parallel component of B 
with v is 0. Similarly we may replace u by ux because the parallel component 
would contribute nothing. Then inserting (12.104) for B_ 

vFy = qvE\\ + qu±_ ■ ~ [(v x E' ± ) x v] . (12.106) 

The quantity in square brackets is E' ± u 2 since by definition E' ± is perpendicular 

to v. Thus 

uE|| = v \qE[ +q\yu± • E' ± ] (12.107) 

and this is the same as (12.100). 

The peipendicular components. This time consider v x F, which is the same as 
vxFi. For the Lorentz force, 

v x Fj_ = qy x Ex + qy x (u x B). (12.108) 

Now, the double cross product is uvfi|| — Bvk||; or, — BuMy because B\\ is 0. 
Inserting (12.103) and (12.104) we have 

v x F_|_ = q\ x (yE' ± ) — qvu\\^y x E' ± = q\ x E ' ± y j^l — . (12.109) 

But by the addition of velocities the term in square brackets is 

y~ 2 (l + vu\/c 2 ^ . 

The final result is 

qE'± 

y (i + ^«y/c 2 ) 


v x Fx = v x 


( 12 . 110 ) 
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which is the same as (12.101). We have proven that the Lorentz force (12.102) is 
just what is required by relativity. 

So, there must be a magnetic force in T if there is an electric force in T' . Of 
course the fields (12.103) and (12.104) are precisely what we would obtain by 
applying the Lorentz transformation of fields, listed in Table 12.3, to compute E 
and B from E'. It follows that E and B form an antisymmetric tensor F^ v . Then 
for covariance the field equations must be Maxwell’s equations. 

In this sense magnetism is a relativistic effect: The Lorentz transformation 
implies the existence of a velocity-dependent force on q, perpendicular to u, in 
any reference frame where the charge sources are in motion. Furthermore, the 
structure of the field equations, including those of magnetism, is completely de¬ 
termined by the requirement of relativistic covariance. Although axiomitization is 
merely formalism, it is interesting that relativity demands certain interactions of 
particles and fields. 


12.7 ■ THE ENERGY-MOMENTUM FLUX TENSOR 

We will study one final theoretical development, relating to energy and momen¬ 
tum of the electromagnetic field. The covariant form of the electromagnetic theory 
is ideal for analyzing energy and momentum, because these quantities are com¬ 
bined by special relativity. The derivations in this section are difficult, but the 
results are rewarding. 24 They reveal the equations for field momentum. 

The equation of motion for a single point charge is (12.52). We may generalize 
this equation, to describe any charge distribution in a volume. Let (x) be the 4- 
current (cp, J). (x denotes the 4 spacetime coordinates.) The Minkowski force on 
the charge in a volume d^x is obtained from (12.52) by replacing qi] v by J v d^x. 
Therefore, if V fl (x) denotes the momentum density (momentum per unit volume) 
of the charged matter, then the equation of motion is 

dV M 

-= J V F^ V . ( 12 . 111 ) 

3r 

In words, J v F tlv is the rate of change of 4-momentum, per unit volume, of the 
charged matter, with respect to proper time. 

Now, local conservation of energy and momentum implies that 3'P /i /3 1 can be 
written as a divergence in Minkowski spacetime, i.e., 

dT^ v 

dyF^ = ——( 12 . 112 ) 
dx v 

Here T ,LV is a tensor, called the energy-momentumflux tensor of the electromag¬ 
netic field. We’ll analyze this equation in detail, to understand the interpretation 

24 It is important to remember that the Einstein summation convention is used throughout this section. 
Greek indices run from 0 to 3, and Roman indices run from 1 to 3. 
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of T ,tv . Later we’ll write an equation for T ,LV in terms of the electromagnetic 
field F' xv . 

Equation (12.112) is a kind of continuity equation. Recall that the continuity 
equation dJ l '/dx 1 ' = 0 expresses conservation of charge, a scalar quantity. In 
the absence of charge, i.e., for J IL = 0, the equation <)T ,,V /dx v = 0 expresses 
conservation of the energy and momentum—a 4-vector quantity—of the electro¬ 
magnetic field. In the presence of charged matter, there may be transfer of energy 
and momentum between the field and the matter. Then the local conservation of 
energy and momentum is described by (12.112). 

To comprehend this equation, and leant the interpretation of T^ v , we inspect 
the temporal and spatial components of (12.112). The p = 0 component is 


j • e _ lar 00 dT 0i 

c c dt dx‘ 


(12.113) 


J • E is the rate at which work is done, per unit volume, on the charged matter at x. 
In order for (12.113) to be a statement of conservation of energy, dT°°/dt must 
be the rate of change of field energy, per unit volume, i.e., T 00 is the field energy 
density. Also, d(cT 0, )/dx' must be the divergence of the energy flux, i.e., cT°‘ 
is the Poynting vector. In other words, (12.113) is just Poynting ’s theorem, which 
we encountered in Chapter 11, expressing the local conservation of energy. 

The p = i (spatial) component of (12.112) is 


pE‘ +€ijkJ j B k 


1 dT i0 dT'i 
c dt dxJ 


(12.114) 


Equation (12.114) must express local conservation of 3-momentum. The left-hand 
side is the force per unit volume acting on the charged matter, i.e., the rate of 
change of its 3-momentum density. In order for (12.114) to be a statement of con¬ 
servation of momentum, d(T l0 /c)/dt must be the rate of change of field momen¬ 
tum density; that is, T'°/c is the field momentum density. Also, dT'j/dx-i must 
be the divergence of the momentum flux. With these interpretations for the three 
parts of (12.114), the rate of change of momentum in a small volume at x equals 
the total flux of momentum into the volume. T'J (x) is the flux (momentum per 
unit area per unit time) in the yth direction, of the ; th component of momentum. 

Now that we have identified the meaning of T^ v , we must write T 1 - 11 ’ in terms 
of the electromagnetic field F^ v . The defining equation is (12.112). As we shall 
verify, the formula for T 1 ' l ’ is 

= — \f^F'' - -g pv F pa F pa \ . (12.115) 

Mo l p 4 J 

Here g ,lv is the metric tensor: 

g C0 = - 1, g n = g 22 = g* = + 1 , and £^=0 for p^v. 

(12.116) 
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It can be shown that g ,lv transforms as a tensor, i.e., A^A v a g pa = g' lv ■ There¬ 
fore (12.115) is a covariant equation. All of our considerations of field energy 
and momentum in this section start from tensor equations, so the conclusions are 
consistent with Einstein’s postulate of relativity. 

To verify that (12.115) does satisfy the continuity equation (12.112), we note 
that the 4-space divergence of (12.115) can be written in the form 


dT» v 

dx v 


1 nn( dFV p\ 1 [BF*” dFW 

— T7 “a-T"-*“ "a — " • 

fl 0 1 \ OX v } 2 |_ ° X U- ° x a OXp J 


(12.117) 


(We have used the fact that g pv d/dx v = d/dx^.) By Maxwell’s equation (12.55) 
the quantity dF v p /dx v , which appears in the first term on the right, is equal to 
—/xo Jp. By Maxwell’s equation (12.56) the quantity in square brackets is 0. Hence 
(12.112) is satisfied. 

Now we may go back to the interpretation of T 1 ' 1 ' and write the energy and 
momentum density and flux in terms of E and B. The field energy density, denoted 
by m(x, t) in previous chapters, is 


u = T 00 = — 
Mo 


E l B l 

2 ? + T 


(12.118) 


where 7' ()() has been evaluated in terms of E and B from (12.115). The result 
agrees with our earlier analyses of field energy. The energy flux is 

S = e iC T 0i = — E x B (12.119) 

MO 

where again T°‘ has been evaluated from (12.115). The result agrees with the 
equation for the Poynting vector derived in Chapter 11. 

The electromagnetic field carries momentum as well as energy. The field mo¬ 
mentum density must be T'°/c, as we argued above. By (12.115) the energy- 
momentum flux tensor is symmetric: T vM = T^ v . Therefore T'° = T°‘ — S' /c. 
The momentum density IEx, t) in the electromagnetic field is Six, t)/c 2 . We have 
used this result previously (in Chapter 11) but without deriving it. 

The relation II = S/c 2 is consistent with the photon theory of light. In the 
photon theory, the energy flux is S = E y nck, where n is the photon density, E y 
the energy of a photon, and k the direction of photon velocity. The momentum 
density is p y n, where the single photon momentum is p y = (E y /c) k because a 
photon is a massless particle. Thus the momentum density in the photon theory 
is S/c 2 , the same as in the classical theory. 
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FURTHER READING 

1. Galileo, Dialogue on the Two World Systems. 

2. A. Einstein, The Meaning of Relativity, 5th ed. (Princeton Univ. Press, Princeton, 1955). 
[This little book is quite readable. The first half is on special relativity.] 

3. K. Krane, Modern Physics, 2nd ed. (Wiley, New York, 1996). [Chapter 2 is an intro¬ 
duction to special relativity, including a discussion of the experimental verification of 
the theory.] 

4. S. T.Thornton and A. Rex, Modern Physics, 2nd ed. (Saunders, Philadelphia, 2000). 
[Chapter 2 is on special relativity.] 

5. R. Resnick, Introduction to Special Relativity (Wiley, New York, 1968). 

6 . E. F. Taylor and J. A. Wheeler, Spacetime Physics (W. H. Freeman, San Francisco, 
1966). 

7. E. M. Purcell, Electricity and Magnetism, 2nd ed. (McGraw-Hill, New York, 1985). 
[This famous introductory book uses the axiomatic presentation of electromagnetism, 
deriving magnetism from the Lorentz transformation.] 

8 . M. Schwartz, Principles of Electrodynamics (Dover, New York, 1985). [This book, too, 
uses the axiomatic presentation. It is more advanced than Ref. [7].] 


EXERCISES 

Sec. 12.1. Coordinate Transformations 

12.1. Derive the Lorentz transformation by assuming that the transformation is linear, 
and does not change the perpendicular coordinates. Write the transformation as 

x = A j (x — vt), y = y, z = z, t' — A 2 t + A$x. 

Determine A i , Aj, A 3 by requiring that a flash of light produces an outgoing spher¬ 
ical wave, with velocity c, in either frame T ox T 1 . 

12.2. If }moves with velocity n i in T, then the Lorentz transformation relating coor¬ 
dinates is 

x' = y (x — vt), y' = y, z = z, t' = y (t — vx/c , 

where y = l/yl — u 2 /c 2 . Find the inverse of this transformation, i.e., solve for 
x,y,z,t in terms of x',y',z',t'. Verify that the inverse transformation has the 
same form as the Lorentz transformation except with the replacement u -> —u. 
This form is necessary for self-consistency since T moves with velocity — u i in 
the frame J- 1 . 

12.3. The mean lifetime of muons is 2 x 10 -6 s in their rest frame. Muons are produced 
in the upper atmosphere, as cosmic-ray secondaries. 

(a) Calculate the mean distance traveled by muons with speed 0.99 c, assuming 
classical physics, i.e., without special relativity. 

(b) Calculate the mean distance with special relativity. 
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(c) What percentage of muons produced atan altitude of 10 km reach the ground, 
assuming they travel downward with speed 0.99 c? [Answer: 9%] 

12.4. Suppose an inertial frame T' moves with velocity v i with respect to another 
frame T. The rapidity of T' is the variable 9 defined by 

tanh 9 = v/c. 

(a) Sketch a plot of 9 versus u, for —c < ii < c. Use computer graphics if you are 
unfamiliar with the hyperbolic tangent, but then explain analytically the form 
of the graph. 

(b) Write the Lorentz transformation matrix Aq) in terms of sinht? and cosh0. 
Compare the result to the transformation matrix for a coordinate rotation about 
the z axis. 

(c) Define similarly the rapidity 9 P of a particle, moving with velocity u i in T, 
by tanh 6 p = u/c. From Einstein’s velocity addition theorem, derive the trans¬ 
formation of particle rapidity from 2F' to 2F . [Answer: 9 P = 9' p + 9] 

Sec. 12.2. Minkowski Space 

12.5. Write (12.20) as a matrix equation, for the transformation matrix in (12.21), and 
verify that the result is equivalent to the Lorentz transformation (12.5). 

12.6. (a) Show that = —a°b° + a • b is invariant under Lorentz transformations, 
(b) Show thatthe invariance of a^b^ implies that b p transforms according to h' fL = 

b a (h~^) a p , where A -1 in the inverse of the matrix in (12.20). 

12.7. The set of Lorentz transformations in a given direction (“boosts”) forms a group. 
Consider boosts in the x direction. Let A^(v) denote the Lorentz transformation 
matrix for a boost of velocity u i. 

(a) Show that the product of two boosts, of velocities iq i and i >2 i , is also a boost, 
corresponding to velocity (iq + V 2 )/(l + v\V2/c 2 ). 

(b) What is the inverse of the boost of velocity v i ? 

(The group of Lorentz transformations in all directions, and rotations, is called the 
Lorentz group.) 

12.8. Sketch a spacetime diagram, i.e., the (x, ct) plane, showing: (i) the light cone (locus 
of points touched by light emitted at x = 0 at time t = 0, (ii) the trajectory of an 
observer at rest at the origin, and (iii) the trajectory of an observer who travels from 
x = 0 at t = 0 to x = L, and then back to x = 0. 

What parts of the spacetime diagram are impossible for a traveler to reach, 
leaving x = 0 at / = 0? 

12.9. Derive the transformation equations for particle velocity. Let u = dx/dt be the 
particle velocity in T , and u' = dx'/dt' that in T'. Determine u J and u' L in terms 
of u || and ui. 

12.10. An inertial frame 2F' moves in the x direction with speed n = 0.9c relative to 
another frame 2F. A traveler is moving with velocity u' = 0.5c i + 0.5c j in the 
frame T'. Determine the traveler’s velocity in the frame T. 
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12.11. (a) Let be the 4-velocity of a particle. Evaluate [Answer: —c 2 ] 

(b) Let p 11 be the 4-momentum of a particle. Evaluate p 1 ' p fl . [Answer: —m 2 c 2 ] 

12.12. Sketch a graph of E — me 2 versus particle speed u, where E is the particle energy. 
On the same graph sketch the classical kinetic energy. Explain why the relativity 
curve implies that a particle cannot accelerate to a speed greater than c. 

12.13. (a) Sketch a graph of particle energy E versus particle momentum p. What is the 

name of this mathematical curve? 

(b) How does E depend on p if p me? What is the curve in this limit? 

(c) How does E depend on p if p ;» me? What is the curve in this limit? 

Sec. 12.3. Electromagnetism in Covariant Form 

12.14. (a) Prove that d/dx 11 transforms as a covariant vector. For example, if <p(x) is a 

scalar function (x means the 4-vector x >l ) then d<p/dx 11 is a covariant vector. 
Before you try to construct the proof, check that the result makes sense, by 
considering dtp/dx^ for <p = C ■ x, where O l is a constant 4-vector, and for 
<p = (C • x) 2 , and for <p = x 2 . (Hint: Use the chain rule of differentiation: 
the derivative with respect to x' is the derivative with respect to x times the 
derivative of x with respect to x’.) 

(b) Prove that the covariant equation dJ ,l /dx ,L is the same as the continuity equa¬ 
tion V ■ J = —dp/dt. 

12.15. Verify that the temporal component (p = 0) of (12.56) is the same as V ■ B = 0, 
and the spatial components (p = i = 1,2, 3) are the same as V x E = —3B/3 1. 

12.16. (a) From the requirement that (cp, J) is a Lorentz vector J 11 , determine the 

Lorentz transformations of p and J. 

(b) Suppose in a frame T ', p' is nonzero, but J' is 0. What are p and J in another 
frame T ? Show that J = pv. Explain why p is greater than p'. 

Sec. 12.4. Field Transformations 

12.17. Prove the following theorem: If B = 0 in some inertial frame T , then B' = —v x 
E '/c 2 in any other inertial frame T '; v is the constant velocity of T' with respect 
to T. 

12.18. In 1952 Einstein wrote: “What led me more or less directly to the special theory of 
relativity was the conviction that the electromotive force acting on a body in motion 
in a magnetic field was nothing else but an electric field.” 

Suppose in an inertial frame T there is a magnetic field B = b j , but the electric 
field is 0. If a charge q moves with velocity u = u i in F, then it experiences a 
force qub k (the magnetic force). Now let T' be the rest frame of q. 

(a) Find E' and B', the fields in T'. 

(b) Find F', the force on q in T'. 

(c) How does the result relate to what Einstein wrote? 

12.19. An observer, Stephanie, measures the electric and magnetic fields of a large charged 
plate, which is at rest in the xy plane with uniform charge density 05 , in a region 
far from the edges of the plate. What are her measured fields Es and Bg? 
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Another observer, Loretta, is moving with velocity n i with respect to Stephanie, 
and Loretta also measures the electric and magnetic fields. What are her measured 
fields, El and Bl? 

Loretta attributes the fields to a surface charge density ctl and current density 
Kl- Relate these quantities to erg. 

12.20. Michele, an astronomy student, observes that the Lyman a spectral line emitted 
from hydrogen in a distant quasar has a wavelength of 790 nm, in the infrared. 
The wavelength of the Lyman a line from terrestrial hydrogen is 122 nm, in the 
ultraviolet. How fast is the quasar moving away from Earth? How did Michele 
recognize this line considering how far it has been shifted? 

Sec. 12.5. Fields of a Moving Charge 

12.21. The fields E(x, t) and B(x, t) due to a charge q moving with constant velocity 
v = ni are given in (12.88) and (12.89). Show that these functions satisfy the 
vacuum form of Maxwell’s equations, apart from the singularity at x = vt i. 

12.22. A proton at the origin exerts a force on the nucleus of a gold atom at rest on the 
z axis at z = lOOfm. (The atomic electrons are irrelevant.) Compute the force on 
the Au nucleus if the proton is (a) at rest, and (b) moving along the x axis with 
speed 0.99 c. Also, in the latter case, plot the magnitude of the force on the Au 
nucleus as a function of time. In making the plot it is convenient to measure time in 
zeptoseconds (zs) where 1 zs = 10 ~ 21 s. Express the forces in MeV/fm. (Neglect 
the motion of the heavy Au nucleus.) 

General Exercises 

12.23. A parallel plate capacitor is at rest in an inertial frame T'. The plates have area A', 
are located at z! = d/2 and z! = —d/2, parallel to the xy plane, and have charge 
— Q and + Q respectively. The capacitor moves in the x direction with speed v in 
another frame T. 

Neglecting edge effects, find the electric and magnetic fields in the frame T. 

12.24. Consider a particle (call it P) with rest mass m moving under the influence of a 
constant force F = F i . For example, P could be a proton in a constant electric 
field. The equation of motion is dp/dt = F where p = mu/^l — « 2 /c 2 . Here u 
is the velocity of P. Assume P is at rest at t = 0. Then u = u i at later times. 

(a) The solution of the equation of motion is p = Ft. Determine u as a function 
of t. Sketch a graph of u versus t. 

(b) Let r be the proper time of P. From the definition (dr = dtf 1 — w 2 /c 2 ) show 
that Ft/me = sinhfEr/mc). Sketch a plot of r versus t. 

(c) Show that.r(r) = (me 2 // 7 ) [cosh(Er/mc) — 1], 

(d) Use computer graphics to make an accurate plot of the worldline of P in the 
plane of points (x, jc°). Use me 1 /F as the unit of length. (Hint: Make a para¬ 
metric plot.) 

12.25. We showed that the Newtonian force F must be invariant under a change of iner¬ 
tial frame if the transformation between inertial frames is Galilean. But the trans¬ 
formation between inertial frames is not Galilean, but Lorentzian. So how does 



484 


Chapter 12 Electromagnetism and Relativity 


the Newtonian force F, defined to be dp/dt, transform under a change of inertial 
frame? 

(a) By transforming the equation of motion F = dp/dt to T' coordinates, i.e., 
writing F' = dp' /dt', prove that 


F« = Fit 


duj_ • Fj_ 

(1 — V ■ u/c 2 ) 


F' F± 

X y (l — v ■ u/c 2 ) 


where u is the velocity of the particle. (|| and _L refer to vector components 
parallel and perpendicular to v.) 

(b) From the Lorentz transformations of u, E and B, prove that the Lorentz force 
?(E + ux B) transforms in this manner. 

12.26. A charge q moves with constant velocity u = u i. Define the antisymmetric tensor 
r> lv by 


r* v = - (r)»x v - rj v x^) 
c 7 

where is the 4-velocity of q. Also, define r 2 = -r^r^t 2. 

(a) Prove that 


r 2 = 


(x — lit) 
1 — m 2 /c 


2 + y 2 + z 2 - 


(b) From equations (12.88) and (12.89) for E and B of the moving charge, show 
that the electromagnetic field tensor is 


pixv _ 


q r» v 
4jreoc (r 2 ) 3 / 2 


This is the electromagnetic field due to a moving charge, in covariant form. 
Such a fundamental quantity has a beautiful form when written covariantly. 
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We have learned that the field theory of electromagnetism is a unified theory in¬ 
cluding both time-independent and time-dependent phenomena of electricity and 
magnetism. As part of this study we saw in Chapter 11 that the Maxwell equations 
for E(x, t) and B(x, t) in vacuum have wave solutions. We explored the properties 
of electromagnetic waves, valid over a wide range of frequencies. In vacuum, all 
electromagnetic waves travel at speed c = 3.00 x 10 8 m/s. 

In this chapter we will apply and extend the same principles to consider elec¬ 
tromagnetic waves interacting with matter. An important part of this subject is 
the branch of physics called Optics, the study of light. Among their many in¬ 
teresting properties, light waves from the Sun bathe the Earth and support life. 
They make vision possible—the ability to see our surroundings (including this 
textbook). From the electromagnetic theory, we’ll come to understand how light 
propagates in matter, how it reflects from materials (dielectrics and conductors), 
the role of polarization, and the basis for many familiar optical phenomena. 

It’s useful to bear in mind that a lot of what we’ll study in this chapter holds 
not only for light (visible, infrared, ultraviolet) but also for other kinds of electro¬ 
magnetic waves (radio, microwaves, X rays). The behavior of the fields depends 
on the material with which the wave is interacting. The idea is that wave solutions 
in vacuum are similar for all frequencies and wavelengths, because in vacuum 
E(x, t) and B(x, t) interact only with each other. But when electromagnetic waves 
interact with matter —it’s usually the interaction between the electric field and the 
electrons that determines what happens—then atomic and molecular properties 
become important. 


13.1 ■ ELECTROMAGNETIC WAVES IN A DIELECTRIC 

Recall that by a dielectric we mean a linear material with electric permittivity 
€ and magnetic permeability /z but which has negligible conductivity, rr = 0. 
Examples of dielectrics are glass, water, plastics, and other insulators. In this sec¬ 
tion we’ll assume, for simplicity, that e and /z are real constants, independent 
of the wave frequency. The constitutive equations are D(x, t) = eE(x, t) and 
H(x, t) = B(x, ?)//z. Electromagnetic waves in dielectrics are similar in many 
ways to those in free space, except that they travel more slowly. 
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The Maxwell equations in a uniform dielectric with neither free charge (pf ree = 
0) nor free current (Jfree = 0) are 


V 


V D = 0 

and 

V • B = 0, 

(13.1) 

„ 3B 

x E =- 

3 1 

and 

V x H = —. 

3 1 

(13.2) 


These have the same form as Maxwell’s equations in vacuum, with the replace¬ 
ments eo —»■ e and fio —> i-i. Therefore (13.1) and (13.2) have transverse wave 
solutions with the same form as electromagnetic waves in vacuum. However, the 
wave speed in the dielectric is 


v = l/y/eji. 


(13.3) 


smaller than the wave speed in vacuum c = 1 /y/eo/iQ. The index of refraction of 
the dielectric, denoted by n, is defined as the ratio of wave speeds 


n = 


c 

V 


so that 


n = 



(13.4) 


(13.5) 


For common dielectrics the magnetization is small, so /u & n 0 ; and e > eo- Then 
v is less than c, and n > 1. Table 13.1 lists values of the index of refraction for 
some common dielectrics. Much of classical optics is concerned with the effects 
of the index of refraction. The fact that n is directly related to the electric and 
magnetic parameters e and n relates optics to electricity and magnetism. 

In applying (13.5) it is important to note that e for real dielectrics may de¬ 
pend on frequency. 1 The ratio e/eo must be measured at the frequency of the 
light whose index of refraction is being calculated. It is too naive to substitute 
into (13.5) the static dielectric constant k = e/eo, which we studied in Chapter 6. 


TABLE 13.1 Index of 
refraction of various 
materials for light of 
wavelength 589 nm in 
vacuum 


Air 

1.0003 

Water 

1.33 

Crown glass 

1.52 

Diamond 

2.42 


1 A theoretical model for the frequency dependence of e is described in Sec. 13.4. 
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(However, most dielectrics are approximately nonmagnetic at all frequencies, so 
in (13.5) we can set \x/jx o = 1.) The frequency dependence e = e(«) is usu¬ 
ally negligible for nonpolar gases (e.g., air or inert gases), nonpolar liquids (e.g., 
benzene), and nonpolar solids (e.g., diamond). For those substances (13.5) gives 
a good approximation to the index of refraction of light even if the static value for 
k is used. But clearly this formula does not give the correct index of refraction for 
water, if one compares the value n = 1.33 in Table 13.1 with v 7c = \/8() ~ 9 
from Table 6.3. For water, and other polar materials, the ratio e(a>)/eo is smaller 
at optical frequencies than for the static case, because the molecular dipoles in 
the material cannot keep up with the rapidly changing E field in light, so the net 
polarization is smaller. In contrast, in a microwave oven, where the frequency 
(typically 3 x 10 9 Hz) is much lower than optical frequencies (typically 6 x 10 14 
Hz) the dipoles do oscillate in phase with the field. It is the thermalization of this 
motion that cooks the food. 

The fields of a polarized plane wave in the dielectric are 
E(x, t) = E 0 e' (k ’ x_co,) 

B(x, t) = B 0 e' (k ’ x_ “ f) (13.6) 

where the real part is understood to be the physical field on the right-hand side of 
these equations. The fields in (13.6) must satisfy Maxwell’s equations in the form 
(13.1) and (13.2). Applying the divergence equations (13.1) to these fields gives 

eV • E = eik • E 0 e' (k ' x-<u,) =0 
V • B = ik ■ Boe'' (k ' x ~"° =0 (13.7) 

which imply that the amplitudes Eo and Bo are orthogonal to the wave vector k. 
Hence the electromagnetic wave is transverse. Applying Faraday’s Law (the first 
equation of (13.2)) to the fields gives 

ik x E 0 e ,(k,x -"') = ia)B 0 e' (k " x_ “ , \ (13.8) 

which relates the two field amplitudes, 


.. k x E 0 

Bo = -. 

CO 

Similarly, from Ampere’s Law (the second equation of (13.2)), 


Eq = 


—k x Bo 
pteu) 


(13.9) 


(13.10) 


Equations (13.6) to (13.10) show that electromagnetic waves in a dielectric 
have the same form as in vacuum. In both cases the vectors Eo, Bo, and k are an 
orthogonal triad, and the wave propagates in the direction of the wave vector k 
with velocity co/k. 
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One difference between electromagnetic waves in vacuum and those in a di¬ 
electric is the wave speed. In vacuum there is a unique propagation velocity c, 
and the dispersion relation is co = ck for all frequencies. The vacuum phase ve¬ 
locity Uphase = co/ k and group velocity u groU p = dco/dk are both equal to c. 
So in vacuum all electromagnetic waves, whether a monochromatic laser beam 
or a polychromatic sun beam, travel at speed c. In a real dielectric, however, the 
propagation velocity v given by (13.3) depends on the frequency because e varies 
with co. The dispersion relation has the form co = v(co)k; it follows that u p h a se 
and Ug r oup are in general different and not equal to c. So, for example, the wave 
speed of a laser beam in a dielectric depends on the laser frequency; also, a pulse 
will spread in time because u groU p / u p h a se- Or, a sun beam will be dispersed as 
it passes through the material because its constituent frequencies propagate with 
different velocities. We will discuss phase and group velocities more later. 

The intensity of a plane wave is defined as the mean energy flux. The Poynting 
vector (energy flux vector) in a linear material isS = ExH = Ex B//t. In 
applying this equation we must use the real parts of E and B. To calculate S for 
the plane wave (13.6), assume for now that Eo, Bo, and k are all real. In that case 
we have 

Eo X Bo T £n T 

S =-—- cos 2 (k • x — cot) = k —- cos 2 (k • x — cot). (13.11) 

p pco 

The second equality follows from the fact that Eo, Bo and k form an orthogonal 
triad, and Bo = kEo/co. The intensity of the plane wave is therefore 

/ = (r s ) = l ev£ °’ (13 - 12) 

where the factor 5 is the time average of cos 2 (k • x—cot ). We have derived (13.12) 
for the case of real Eq and Bq but the result is more general . 2 


13.2 ■ REFLECTION AND REFRACTION AT A DIELECTRIC INTERFACE 

The fact that the speed of light is different in different dielectrics leads to the 
important optical phenomena of reflection and refraction at a dielectric interface, 
as, for example, the reflection of the sky in the surface of a lake. The geometrical 
laws that govern reflection and refraction are a direct consequence of the physical 
boundary conditions that E and B must satisfy at the interface. First we will derive 
these laws, and later we will determine the reflectivity itself. 

Consider the wave solution illustrated in Fig. 13.1. The yz plane is the interface 
between two dielectrics, with parameters ei, p\ for x <0 and 62 , M 2 for x > 
0. The x axis is normal to the interface. In the region x < 0 the wave is the 
superposition of an incident plane wave with wave vector k and a reflected plane 
wave with wave vector k"; in the region x > 0 is a transmitted wave with wave 

2 See Exercise 3. 
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FIGURE 13.1 Wave vectors for reflection and refraction of light at the interface x = 0 
between two dielectrics. Dielectric 1 is to the lef t of the interface, where x < 0. Dielectric 2 
is to the right of the interface, where x > 0. The arrows point in the directions of the 
wave vectors k, k\ and k". (a) Light incident from an optically less dense dielectric onto 
an optically denser dielectric, n ] < nj- (b) Light incident from an optically more dense 
dielectric onto a less dense dielectric, n j > /22- 


vector k'. The electric field is 


E(x, t) 


E () e ,(k ' x - M ‘> + Ege , ' (k "-*-“"0 for x < 0 
E^'( k 'for x > 0. 


(13.13) 


The magnetic field can be determined by Faraday’s Law, as in (13.9), for each 
term. Figure 13.1 shows the rays of the three waves, which are lines in the direc¬ 
tions of the wave vectors k, k', and k". In each wave the electric and magnetic 
fields are everywhere orthogonal to the light ray. 
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Normally there is neither free surface charge nor free surface current on the 
interface. In that case the four boundary conditions that must be satisfied are 

6i£i_l = * 2 E 2 ± and Bi± = i? 2 i, (13.14) 

Em = E 211 and Bi||/Mi = ® 2 ||/M 2 . (13.15) 

where 1 and 2 indicate the two dielectric regions. The subscripts _L and || denote 
the directions perpendicular and parallel to the interface, also called the normal 
and tangential directions, respectively. 

13.2.1 ■ Wave Vectors 

The boundary conditions (13.14) and (13.15) must hold for all t. Therefore the 
incident, reflected, and transmitted waves must have the same frequency 

co = co' = co". (13.16) 

For a plane wave, co/k is equal to the wave speed c/n. The index of refraction n is 
the same for the incident and reflected waves because they are both in dielectric 1, 
so the equality co" = co implies k" = k. But the refracted wave is in dielectric 2, 
so that co' = co implies 

k' = —k. (13.17) 

n\ 

If «2 > n\ (the case illustrated in Fig. 13.1(a)) as forexample the case of a wave 
in air incident on water or glass, the wavelength of the refracted wave (2 tc /k') 
is less than that of the incident wave ( 2n/k ). Although the frequency remains 
unchanged when a light wave enters an optically denser medium, the wavelength 
and speed both decrease. 

The incident wave in Fig. 13.1 is a plane wave and therefore illuminates the en¬ 
tire interface, i.e., the yz plane (x = 0). Similarly the reflected and refracted waves 
come from the entire yz plane. The boundary conditions (13.14) and (13.15) must 
hold for all y and z at x = 0. Therefore the variation with y or z must be the same 

for all three waves. In other words, the y and z components of the three wave 

vectors (incident, reflected, and refracted) must be equal: 

k y = k' y = k”, (13.18) 

k z =k' z = k'!. (13.19) 

Suppose the wave vector of the incident wave has no z component, k z = 0. 
In that case k' z and k " are also both zero, according to (13.19). This means that if 
the incident ray lies in the xy plane then the reflected and refracted rays will also 
lie in the xy plane. In our idealized solution (13.6) the wave fronts are infinite. 
In practice, a beam from a laser might be used for demonstrations or laboratory 
experiments on reflection and refraction. The laser beam is not infinite, but its 
diameter is many wavelengths, so the plane wave approximation (13.6) is valid. 
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If the incident beam direction is in the xy plane then the reflected and refracted 
beams are in that same plane, called the plane of incidence. 

Equation (13.18) will be used to connect the angles of incidence 0, reflec¬ 
tion O' , and refraction 0". The relations are familiar but remarkable. The three 
angles, shown in Fig. 13.1, are defined as the angles between the respective wave 
vectors and the normal to the interface. If the xy plane is the plane of incidence 


then the wave vectors are 

k = n\ — cos 9 + j sin#j , (13.20) 

k' = «2~ cost/ + j sin#'j , (13.21) 

k" = «i — ^— i cos 0" + j sin0"j . (13.22) 

The conditions (13.18) then imply 

sin0 = sin0", (13.23) 

n\ sin# = « 2 sin$'. (13.24) 


Equation (13.23) is the law of reflection for specular reflection; in words, the angle 
of incidence equals the angle of reflection. This behavior of light was known to 
the ancient Greeks. Equation (13.24) is the law of refraction, or Snell’s law. 3 

Total Internal Reflection 

Figure 13.1(a) illustrates reflection and refraction for the case nz > «i • In that 
case, (13.24) implies O' < 0, which means that the incident beam is refracted 
toward the normal as shown in the figure. Figure 13.1(b) illustrates the case for 
n i > « 2 , so that O' > 0, which means that the beam is refracted away from the 
normal. A critical case occurs in Fig. 13.1(b) when O' = tz/2. For that special 
case the angle of incidence is called the critical angle 0 C , and (13.24) becomes 
n\ sin= nz, so that 


0 C = arcsin 



(13.25) 


If light is incident at an angle equal to or greater than the critical angle, 0 > 
0 C , then the incident beam is totally reflected in the sense that all the energy in 
the incident beam goes into the reflected beam. This phenomenon is called total 
internal reflection. For reflection from a glass-air interface the critical angle is 
0 C = arcsin(1.0003/1.52) = 41°. Total internal reflection can only occur for light 
incident from an optically dense material to a less dense material, i.e., from a 
negative change of n. 

3 In France, Snell's law has been called the Snell-Descartes law. Snell discovered the relation o f sines 
in 1621 but did not publish it. Descartes rediscovered the relation and published it in a book on optics 
in 1637. 
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It is interesting to ask, for the case of total internal reflection, what happens to 
the transmitted wave in (13.13). Forany angle of incidence, the transmitted wave 
can be written, by inserting (13.21) into (13.13), as 

E'(x, t) = E' 0 exp Jr ^« 2 ~ cos O'x + « 2 ~ sin£'y — cot^J J . (13.26) 

In writing (13.26) we’ve taken the plane of incidence to be the xy plane as before, 
and set to' = to by (13.16). Equation (13.26) informally correct for any angle of 
incidence. By Snell’s law, the factor «2 sin O' in the middle term in the parentheses 
can be replaced by n \ sin ft. But if the angle of incidence is greater than 0 C , then 
cos O' is purely imaginary, 


where 


cos O' = yj 1 — sin 2 O' - i%, 


(13.27) 


l = ,/4 sin 2 0-l 


I sin 2 0 
sin 2 0 C 


- 1 . 


(13.28) 


Thus the electric vector of the transmitted wave for the supercritical case 0 > 0 C 
is 


E'(x, t) = E^exp « 2 ~£*) exp |/ — sin£y — tot^j J. (13.29) 

The exponential in y and t tells us that the “transmitted” wave travels parallel 
to the interface, i.e., in the y direction, with phase velocity v p = c/(n i sin 0). 
But the exponential in x shows that as this wave penetrates into dielectric 2 (the 
less dense medium) its amplitude falls off exponentially with distance x. The 
attenuation length 5, given by 


S = ——, (13.30) 

is the distance in x over which the amplitude of E' decreases by the factor 1/e. 

For 0 < 0 C there is nothing unusual. In that case we would observe, in addition 
to the reflected wave in medium 1, a transmitted wave in medium 2 bent away 
from the normal, i.e., toward the surface. But now let 0 increase, tending to 0 C . 
As 0 approaches 0 C , the direction of the transmitted wave approaches the surface, 
and when 0 = 0 C it is parallel to the surface. For 0 > 0 C the wave in medium 2 
propagates parallel to the surface but it does not extend far into medium 2 because 
its amplitude is attenuated exponentially, as we see from (13.29). Because the 
fields decrease with distance x into dielectric 2 (the less dense dielectric), the 
electric and magnetic fields in dielectric 2 are called an evanescent wave. 

Associated with the electric field (13.29) is a magnetic field B(x, t ), given 
by (13.9). We then ask the natural question: What energy is associated with the 
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evanescent wave? The answer is surprising. There is no net energy flow across 
the boundary because the time averaged Poynting vector in the x direction is 
zero. But there is a finite flow in the y direction. This flow of energy parallel to 
the boundary is consistent with total internal reflection because in this ideal case 
the incident and reflected waves are plane waves with infinite wave fronts. 4 

If another refractive medium with index n \ is brought very close to the inter¬ 
face, say to within x < S, then a transmitted wave will appear in it. This effect is 
usually offered as support for the existence of the evanescent wave. But the argu¬ 
ment is somewhat indirect because the presence of the new medium changes the 
boundary conditions by introducing a reflected wave at the new interface. 

Near-field Scanning Optical Microscopy (NSOM) is a technique that uses 
evanescent waves to obtain optical images of surface details much smaller than 
the wavelength of light. Ordinary optical microscopy using lenses is limited, be¬ 
cause of diffraction, to resolving distances > A/2, where A is the wavelength of 
the incident light. In NSOM the evanescent waves are produced by light emitted 
from a subwavelength aperture, of diameter, say, A/5 100 nm or smaller, at 

the tip of a tapered optical fiber. There are evanescent waves in the neighborhood 
outside the tip. These waves are exponentially attenuated with distance, but if the 
probe is scanned over a surface very close to the tip, say within A/20 ~ 25 nm, 
then light is reflected from the scanned surface. Analysis of the reflected light 
reveals surface details at resolutions less than 100 nm. Among the advantages of 
NSOM is that it can be carried out on samples in air or water, which makes it 
useful for studying biological materials. 

Total internal reflection has many applications. At the glass-air interfaces of 
the prisms in binoculars, total internal reflection makes erect the inverted image 
formed by the objective lenses. Total internal reflection is also the basis for effi¬ 
ciently guiding light along optical fibers. Figure 13.2 is an illustration of an optical 
fiber; the light internally reflects at points along the glass-air interface of the fiber 



FIGURE 13.2 Illustration of an optical fiber. Light undergoes total internal reflection at 
the. glass-air interfaces. There is a small field for a short distance outside the fiber—the 
evanescent wave. 


4 See Ref. [1], 
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surface. Optical fibers of this kind are widely used in communication, where they 
make possible transmission of information at optical frequencies over long dis¬ 
tances. In medicine they are used to view inaccessible tissues and, with lasers, to 
deliver heat energy there. 

13.2.2 ■ Reflectivity for Normal Incidence 

A basic problem in the electromagnetic theory of optics is to calculate the reflec¬ 
tivity, i.e., reflected intensity, at the interface between two dielectrics, in terms 
of the electric and magnetic parameters ei, mi and e 2 , M 2 - We ask now: If light 
is incident normally on the interface, how much of the energy is reflected and 
how much is transmitted? This is the special case with 6 = 0, and therefore 
6 1 = 6" = 0 in Figs. 13.1(a) or (b). In this case then, k is directed normally 
toward the interface, and both k' and k" are directed normally away from the 
interface. 

As we know, in an electromagnetic wave E and B are perpendicular to the 
propagation direction. For normal incidence, therefore, E and B for the incident, 
reflected, and transmitted waves, are all parallel to the dielectric interface. That 
will make it simple to solve the boundary conditions (13.14) and (13.15). First, 
the conditions (13.14), on the perpendicular components, are satisfied trivially be¬ 
cause all the perpendicular field components are zero. The tangential conditions 
(13.15) would be the same for any polarization, but to be definite let the polar¬ 
ization direction be j. Then Eo = £0 j, Eg = £g j, and Eg = E g'j. Also, by 
Faraday’s Law (13.9) 5 

Ho = —z, Hg = ^_z and Hg' = —^-z. (13.31) 

Ml w l M2^2 Mi iq 

Thus the tangential continuity conditions (13.15) are 

Eo + E” = Eg, (13.32) 


Eo-Eq _ e o 

Ml VI M2f2 


(13.33) 


Here U[ = c/n\ and V 2 = c/n 2 are the wave speeds in the two dielectrics. Also, 
k" = —k for normal incidence, leading to the minus signs in (13.31) and (13.33). 
From these boundary conditions we solve for the electric field amplitudes of the 
transmitted and reflected waves. The results are 


f — 
£, 0 — 


2M2«i 


M2«l + Ml»2 


(13.34) 


E 


n _ 

0 — 


M2»1 - Ml«2 „ 
-—- E 0 . 

M2«l + Ml«2 


(13.35) 


5 In this chapter we will use the notation z for the unit vector in the z direction, rather than k which 
might be confused with the wave vectors. 
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Phase Change on Reflection 

In most dielectrics we may approximate /x = /xo . Then if n\ < n 2 , as in the 
case of light incident from air into water or glass, £ ( " and Eq have opposite signs 
because the fraction in (13.35) is negative. That is, the electric field in the reflected 
wave is 180 degrees out of phase with that in the incident wave. (But the magnetic 
fields of the reflected and incident waves are in phase.) On the other hand, if 
n\ > n 2 there is no phase change on reflection. In either case the transmitted 
wave is in phase with the incident wave. 


Reflectivity 

We tend to think of glass or water as transparent to light, but in fact these di¬ 
electrics do reflect some of the incident light. We define the reflectivity, or reflec¬ 
tion coefficient, by the ratio of reflected and incident intensities, 

/" 

R = —j~ . (13.36) 

The wave intensity I is given by (13.12). By (13.35) the reflectivity for normal 
incidence is 


R = 



f P2U 1 ~ / t ]«2 
\M2«i + tt\ni 


(13.37) 


Similarly, the transmissivity, or transmission coefficient, T is defined as I'/I; for 
normal incidence, by (13.34), 


_ 4 < U. 1 ^1«2 

e\V\El {ix 2 n\ + Pin 2 ) 2 


(13.38) 


Note that R + T = 1, which is a consequence of energy conservation: The sum 
of the energy fluxes in the reflected and transmitted waves is equal to the incident 
energy flux. 

For light incident normally from air (n\ = 1) into glass (n 2 = 1.5) the re¬ 
flectivity is R = 0.04; that is, 4% of the incident light intensity is reflected. For 
light incident from air into water (n 2 = 1.33), R = 0.02; from air into diamond 
(n 2 = 2.42), R = 0.17. We approximate /x by /xo for these materials. 

The expressions (13.37) and (13.38) are unchanged if the subscripts 1 and 2 
are interchanged. This means that the reflectivity for light incident from dielec¬ 
tric 2 into dielectric 1 is just the same as the reflectivity for light incident from 
dielectric 1 into dielectric 2. The same is true for the transmissivity. 


Nonreflecting Lens Coatings 

An important application of wave optics is to design a lens-air interface that trans¬ 
mits 100% of the light incident on it. As we will now see, this can be achieved by 
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FIGURE 13.3 A nonreflective coating. The coating extends from x = 0 to x = a and 
has index of refraction n i. The directions of propagation of the plane waves are indicated 
by arrows. The electric vectors are all taken to point in the j direction. 


coating the lens with a layer of dielectric having just the right thickness and index 
of refraction. 

Figure 13.3 shows the geometry. Monochromatic light is incident normally 
from air (no = 1) onto a dielectric layer of thickness a (index of refraction ni) 
that is on the surface of a glass lens (n 2 ). 

In the air there is an incident wave propagating in the + i direction and, in 
general, a reflected wave in the — i direction. In the layer there are waves in both 
the + i and — i directions. In the glass there is only a transmitted wave in the 
+ i direction. If we assume /x = /xo and take the direction of polarization of the 
waves to be j, then the fields in the three regions are 




E 0 }e i(kx ~ w,) + E”}e- i(kx+a)t) 

for 

x < 0 


E(x, 0 = 


E\ j e i{k]X ~ w ‘) + ix+wt) 

for 

0 < x < a 

(13.39) 



£'jg'fex-air) 

for 

x > a 


and 


Eoze i{kx ~ ait ' 1 - Eq z e - i< ~ kx + wt ) 

for 

o 

VI 

H 


cB(x, t) = 


n\E' x ze'^ x ~ w, ' > — n\E x ze~ i< ' k i x+col '> 

for 

0 < x < a 

(13.40) 



n 2 E' 2 ze i ^ x - m) 

for 

x > a. 



The wave numbers are k = co/c, k\ = n\k, and k 2 = n 2 k. The waves that 
propagate in the —x direction are written as e -'(kx+a>t) anc j e -i(k\x+uit ). t j ien a jj 
terms have the same time dependence e~ lwl . Also, we have used the transversality 
of E and B in a plane wave. 
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The boundary conditions (13.15) at the interface x = 0 between air and dielec¬ 
tric are 


Eo + Eq = E\ + E’[ 

(13.41) 

Eo — Eq = ni (E\ - E '[) , 

(13.42) 

a between dielectric and glass are 


E[e ikia + E'{e~ ik ' a = E' 2 e ikia 

(13.43) 

ni (E[e ik ' a - E'{e~ ik A = n 2 E' 2 e ik * a . 

(13.44) 


From these equations the four amplitudes E q, E \, E" and E' 2 can be found in 
terms of the incident wave amplitude Eo for any n\, n 2 and a. 


Zero reflectivity. We seek now a solution to the boundary conditions for which 
there is no reflected wave, i.e., with Eq — 0. To be more precise, we ask: What 
are the requirements on n i and a such that Eq = 0? The algebra is straightfor¬ 
ward. Set Eq — 0 in (13.41) and (13.42), and solve the resulting equations for 
E\ and E". Substitute these into (13.43) and (13.44), and form the ratio of the 
resulting equations. After some simplification the result is 

n 2 (n\ + \)e ,kia — («i — l)e~' iia n\i sin^ia -I- cos a 

n\ (m + \)e ,kia + (m — l)e~ ,kia n\cosk\a + i sin^ia 

Now, the right-hand side of (13.45) must be real (because n\ and n 2 are real) 
and there are only two possibilities; either cos k\a — 0 or sin k\a — 0. It is 
the first case that interests us now. For cos k\a — 0 the possible values of a 
are a = A]/4, 3Ai/4,..., (2 j + l)Ai/4,..., where Ai is the wavelength in the 
dielectric coating. The thinnest coating has a = Ai/4. Then by (13.45) the ratio 
n 2 /n\ must be n\, so the index of refraction of the dielectric coating must be 
n i = yfni- We have done the calculation for no = 1, but it is easy to generalize 
to no # 1. In that case the conditions for zero reflectivity are again that the layer 
thickness must be a — (2j + l)Ai/4, and that the index of refraction of the layer 
must be n i = The ideal index of refraction is the geometric mean of the 

indices on either side of the coating. 

Camera lenses are often coated with a layer of magnesium fluoride MgF 2 , for 
which ni = 1.38, therefore approximately satisfying the condition n\ — = 

\/T5. The thickness of the layer is designed to minimize reflection in the mid¬ 
dle of the range of optical wavelengths. In cameras the purpose of nonreflective 
coatings is to maximize the transmitted light. 

Stealth aircraft technology also uses the technology of nonreflective coatings. 
In this case the purpose is to minimize the reflection of radar waves in order to 
render the aircraft invisible to radar. Because the incident radar may have a variety 
of frequencies, multiple coatings are used. 
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Another way for the right-hand side of (13.45) to be real is if sin/cia = 0. This 
implies fulfillment of the two conditions a = jk\/2 and «2 = 1 (or, more gen¬ 
erally, «2 = no)- We picture, for example, two layers of air separated by a small 
thickness a of glass, or two layers of glass separated by a small thickness a of 
air. The dark bands observed for monochromatic light in Newton’s rings, or from 
light scattered by a thin wedge of air between two optically flat pieces of glass, are 
examples of zero reflectivity at locations where the air gap has thickness jk/2. In 
such cases the zero reflectivity is properly understood as the solution of the com¬ 
plete boundary value problem. It is not simply the result of interference between 
waves reflected from different surfaces, because while the reflected waves have 
phase difference of n they have different amplitudes so that the destructive inter¬ 
ference between them is only partial. To understand the complete cancellation of 
the waves requires the full solution of the boundary value problem. 

So far we have limited the discussion to light that is normally incident on the 
surface, i.e., 6=0. We next consider light incident at an arbitrary angle. 

13.2.3 ■ Reflection for Incidence at Arbitrary Angles: Fresnel's Equations 

For light incident at an arbitrary angle 6 , the intensities of the transmitted and 
reflected waves depend on the direction of polarization of the wave. Therefore 
we will consider separately the reflectivity for TE (transverse electric) and TM 
(transverse magnetic) polarization. 

TE Polarization 

The definition of TE polarization is that the electric field is perpendicular to the 
plane of incidence. The plane of incidence is the plane spanned by the normal to 
the surface and the incident wave vector k. In Fig. 13.4 the plane of incidence is 
the xy plane. The TE electric fields of the three waves—incident, transmitted, and 
reflected—are 


E = z£ 0 e ,(k ' x_ “" ) , (13.46) 

E=z E' 0 e i(k '- x -‘ o, \ (13.47) 

E" = £E^e i(k "‘ x ~ a,t) . (13.48) 

The magnitudes of the wave vectors are k = k" = co/v\ and k' = co/vj. In the TE 
case the electric fields are all parallel to the boundary, so the boundary condition 
D\x = £> 2 . 1 . is trivially satisfied. The boundary condition Eiy = E 211 implies 

£ 0 + K = £q. (13.49) 

In Fig. 13.4 the three electric fields are drawnpointing in the +z direction. The 
choice of direction +z or —z is arbitrary, because the electric vectors oscillate in 
both space and time. Also, there may be a phase change on reflection, so that 
£q has the opposite sign as Eq. In any case, it is necessary to use (13.9), i.e., 
B = //H = k x E/co, to find the magnetic field directions. From Fig. 13.4, 
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FIGURE 13.4 Eleclric fields, magnetic fields, and wave vectors for light incident on the 
interface between two dielectrics, and polarized perpendicular to the plane of incidence 
(TE polarization). The xy plane is the plane of incidence and the plane x = 0 is the 
interface, so that the E vectors are parallel to the interface. For each wave the vectors E, B, 
and k form an orthogonal triad that satisfies B = k x E/co. 


which shows the directions of E and H for each of the three waves, we find the 
amplitudes of the magnetic fields to be 

En r ~ ~ 1 

Ho =- i sin 6 — j cos 6 

AUifi L J 

Hq = —— [i sin0' — j cos O' 

V-2V2 L 

K'L r /v /v ~] 

H!j = —— i sin 9 + j cos 6 

V- 1«1 L j 

where v\ — c/n\ and V 2 = c/« 2 - From (13.46) to (13.48) and (13.50) to (13.52) 
it is straightforward to verify that E x H is in the direction of the corresponding 
wave vector for each wave. 

The i component of B is normal to the interface; the boundary condition 
B ix = B 2 x is the same as (13.49) by Snell’s law. The j component of H is parallel 
to the interface, so the boundary condition Hi n = H 211 implies Ho y + H^ y = Hq , 
i.e., 


(13.50) 

(13.51) 

(13.52) 


ni (£0- E'') 


cos 6 


n 2 E' 0 


cos O'. 


(13.53) 


AD fi 2 

Equations (13.49) and (13.53) make two equations for the two unknowns E' {) 
and E" y The solutions for these amplitudes are 
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FIGURE 13.5 Graphs of Eq/Eq, the ratio of amplitudes of the reflected to the incident 
wave, and £q/ Eq, the ratio of amplitudes of the transmitted to the incident wave, for the 
case of TE polarization. The abscissa is the angle of incidence 9. The light is taken to be 
incident on the interface from air (n j = 1) to glass (n 2 = 1.5). 


E'o 

E'o 


2 \.iin 1 cos 6 

-£0, 

in 1 cos 0 + Mi«2cos0' 

in 1 cos 6 — ii\ «2 cos O' 




(13.54) 

(13.55) 


For most dielectrics we may approximate in = 112 = /z 0 ; then the second result 
may be simpified, using Snell’s law, to 


E 


n _ 

0 - 


sin(fl ; -e) £ 

sin {O' + 0) ° 


(13.56) 


which is Fresnel’s equation for the amplitude of the reflected wave with TE po¬ 
larization. 6 

Figure 13.5 shows plots of Eq/Eq and E' 0 /Eq as functions of the angle of 
incidence 8 for TE waves incident from air into glass. The sign of Eq/Eq is 
always negative, so the reflected wave is 180° out of phase with the incident wave. 


TM Polarization 

The definition of TM polarization is that the magnetic field is perpendicular to the 
plane of incidence. The magnetic fields in terms of H = B//z are 

U = z-^-e i(k ' x - wl) , (13.57) 

P\v\ 

H' = (13.58) 

H2V2 


6 See Exercise 6. 
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FIGURE 13.6 Electric fields, magnetic fields, and wave vectors for light incident on 
the interface between two dielectrics and polarized parallel to the plane of incidence (TM 
polarization). The xy plane is the plane of incidence and the plane x = 0 is the interface, 
so that the B vectors are parallel to the interface. 


H" = 

£ E 0 e i(k"-x-<ot) 

IMV\ 

(13.59) 

Figure 13.6 shows the relations among E, H, and k, where the H fields have been 
shown pointing in the +z direction. The boundary condition H] u = H 211 implies 

Ml (p , p"\ _ ” 2 pi 

— l £ 0 + E 0 ) - —E 0 . 

Ml M2 

(13.60) 

The electric field amplitudes for TM polarization are 


Eo = £0 

— i sin # + j cos # j 

(13.61) 

Eo = £0 

— i sin O' + j cos O' j 

(13.62) 

17" _ p" 

M) ~ Eq 

— i sin # — j cos#j . 

(13.63) 


Referring to Fig. 13.6 the j component of E is parallel to the surface, so the con¬ 
dition Ei || = E 211 implies 

(£0 - Eq) cos 6 = E' 0 cos 6 '. (13.64) 

Equations (13.60) and (13.64) can be solved for the reflected and refracted ampli¬ 
tudes. The results are 


E o = 


2 /jL 2 n\ cos# 

jU.2«l COS#' + IXIH2 COS# 


Eo 


(13.65) 
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E 


n _ 

o — 


jliin2COS0 — cos#' 
fi 2 n\ cos O' + n\ri 2 cos 6 


Eq. 


(13.66) 


Again, if m = 112 = Mo then the second result may be simplified by the use of 
Snell’s law; after some algebra (see Exercise 6) 


E 


n _ 

0 — 


“ n -< 8 - e "> a 
tan (0 + O') 


(13.67) 


which is Fresnel’s equation for TM polarization. 

Figure 13.7 shows plots of Eq/Eo and E' 0 /Eo as functions of the angle of 
incidence for TM waves incident from air (n\ = 1) into glass («2 = 1-5). At 
grazing incidence (0 = 90°) all the light is reflected. For those angles for which 
Eq/Eo > 0, the reflected wave is 180° out of phase with the incident wave, but 
when Eq/Eo < 0 the reflected and incident waves are in phase. 7 

In solving this problem we used the two boundary conditions expressed in 
(13.15), namely, the continuity of H|| and of Ey. However, four boundary condi¬ 
tions, including those of (13.14), must be satisfied at the interfacial plane x = 0. 
What became of the conditions (13.14) on the normal components? The answer 
is that the continuity of B\ is automatically satisfied for TM polarization because 
is zero for all three waves. The continuity of eE\ is equivalent to (13.60) by 
Snell’s law. 

It is interesting to carry the analysis of TM polarization further by calculating 
the reflectivity R and transmissivity T. The reflectivity is again defined as the 



FIGURE 13.7 Graphs of £q/£o> the ratio of amplitudes of the reflected to the incident 
wave, and E' 0 /Eo, the ratio of amplitudes of the transmitted to the incident wave, for the 
case of TM polarization. The abscissa is the angle of incidence 9. The light is taken to be 
incident on the interface from air (n j = 1) to glass («2 = 1.5). 

7 The sign conventions for Eq of TE and TM waves, defined respectively in Figs. 13.4 and 13.6, are 
different. 
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ratio of reflected intensity to incident intensity, 8 

€\ViEq 2 /2 (ri 2 cos 0 — n\cosO') 2 
6]V]El/2 (ni cos#' + n 2 cos0) 2 ' 


(13.68) 


Similarly, the transmissivity is the ratio of transmitted and incident intensities, 

T = WlVlE = 4/7 [H 2 cos 2 0 (13 69) 

6\V\Eq/2 (/7] COS#'+ 772 COS#) 2 ' 

The conservation of energy for TM polarization (and also for TE polarization) is 
expressed by the equation 


cos 9' 

- T + R = 1, (13.70) 

COS# 

and it is straightforward to verify that this equation is satisfied. 9 

In writing (13.70) it is necessary to multiply T by the factor (cos O' / cos 0), 
which is the ratio of the cross-sectional area of the transmitted beam to that of the 
incident beam. The reason for this factor is that T is the ratio of the transmitted in¬ 
tensity to the incident intensity. Intensity is power per unit area, with units W/m 2 . 
It is not intensity, but integrated power, that is conserved. The transmitted beam 
has a different cross-sectional area from the incident beam. Figure 13.8 explains 


y 



FIGURE 13.8 Areas of the incident and transmitted beams. If the incident beam has 
cross-sectional area A;, shown at ab, then the area illuminated on the x = 0 dielectric 
interface is A j = A,- / cos 6 , shown at ac. The cross-sectional area of the transmitted beam 
is A f = A; (cos O'/ cos 6), shown at dc; note that the area of the transmitted beam is greater 
than the area of the incident beam. The angle of incidence is 6, the angle of refraction is #', 
and n\ sin# = «2 sin#'. 

8 We have set /q = H2 — A*0■ 

9 See Exercise 7. 




504 


Chapter 13 Electromagnetism and Optics 



FIGURE 13.9 The reflectivity R and transmissivity T plotted as functions of incident an¬ 
gle 9 for TM polarization. The transmissivity has been scaled by the factor (cos 9'/ cos 6) 
as explained in the text. The light is taken to be incident on the interface from air («j = 1) 
to glass ( ri 2 = 1.5). 


the geometry. If the cross-sectional area of the incident beam is A,- then the 
area that it illuminates on the dielectric interface is Ad = A, / cos 6. The cross- 
sectional area of the transmitted beam is A, = Ad cos O' = A,- (cos O '/ cos 0). If 
the transmitted beam is refracted toward the normal, as in Fig. 13.8, then A, > A,-; 
or, if the transmitted beam is refracted away from the normal, A, < A,-. In any 
case the fact that the areas are different must be factored into the comparison of 
the integrated power. It is not necessary to include a similar factor for the reflected 
wave, because the areas of the reflected and incident beams are equal. In the case 
of normal incidence, 0 = 0' = 0, the areas of all three beams are the same. 

The transmissivity and reflectivity forTM polarization are plotted in Fig. 13.9. 

Brewster's Angle 

For TM polarization, the intensity of the reflected wave is 0 if 0 + O' = rr/2, 
because then the denominator in (13.67) is infinite. (We assume /i\ = /X 2 .) The 
angle of incidence at which this condition occurs is called Brewster’s angle Or. 
By Snell’s law, with 0 = 0g and O' = rr/2 — 0g, n\ sinf^ is equal to ti 2 cos 0g\ 
therefore, 


tan 0 B = ~. (13.71) 

n\ 

If the angle of incidence of a TM wave is 0g then there is no reflected wave. The 
reflectivity tends to 0 as 0 approaches 0g. 

The condition (13.71) for Brewster’s angle, together with Snell’s Law, has a 
simple geometrical interpretation: For a TM wave with 0 = 0g, the vectors k" 
and k' are orthogonal, because 0g+0' = ir/2. (Note from Fig. 13.6 that the angle 
between k' and k" is n — 0 — O', which is zr/2 if 6> = 0g.) Or, because Eq is also 
perpendicular to k', at Brewster’s angle the polarization of the transmitted wave 
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FIGURE 13.10 Reflectivities versus angle of incidence for TE and TM polarization for 
light incident on the interface from air («i = 1) to glass («2 = 1.5). 


(Eq) is parallel to the direction of specular reflection (k")- When this occurs the 
intensity of the reflected wave is 0. 

If unpolarized light is incident on a dielectric surface at angle Op then the re¬ 
flected light is TE polarized because the amplitude of the TM reflected component 
is 0, as we have seen. The reflected light is 100% polarized at 9 = 6p, and it is 
partially polarized at other angles. Figure 13.10 shows the reflectivities for the 
two polarizations, as functions of the angle of incidence, for light incident from 
air (m = 1) into glass («2 = 1.5). In this case Brewster’s angle is 56°. The differ¬ 
ence between the two curves is a measure of the degree of polarization of reflected 
unpolarized light. This graph shows why polarized sunglasses reduce glare: When 
unpolarized light reflects from a dielectric surface, the reflected light is highly po¬ 
larized. If the sunglasses’ polarization axis is orthogonal to the polarization of the 
reflected light, then the intensity of reflected light that will pass through the sun¬ 
glasses is low. For instance, sunglasses for people who fish are designed to absorb 
horizontally polarized light, so as to reduce glare from sunlight reflected from the 
water surface. 

Grazing Incidence 

It is interesting to note from Fig. 13.10 that R —► 1 as 9 —► 7t/ 2, for either 
polarization. That is, at grazing incidence the light is 100% reflected. This agrees 
with our everyday experience of light hitting a glass surface at grazing incidence. 


13.3 ■ ELECTROMAGNETIC WAVES IN A CONDUCTOR 

In this chapter until now we’ve considered how electromagnetic waves interact 
with dielectrics. The dominant underlying physical process is the interaction be¬ 
tween E of the electromagnetic wave and electrons bound in atoms. Next we 
consider how electromagnetic waves interact with conductors, materials with free 
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electrons. The dominant interaction for this case is between E and the free elec¬ 
trons. This interaction has many interesting consequences. For example, metals 
are shiny—that’s why a silvered mirror reflects light so well. Also, light propa¬ 
gating in a metal is rapidly attenuated—that’s why metals are opaque. The speed 
of propagation of electromagnetic waves in metals is much smaller than in vac¬ 
uum or dielectrics. The magnetic field component lags behind the electric field 
for an electromagnetic wave in a conductor, and much more of the energy re¬ 
sides in the magnetic field than in the electric field. We shall explore how these 
phenomena follow from the Maxwell equations. 

In applying Maxwell’s equations to metals, we will describe the metal as a 
linear material with ohmic conductivity. The free current in the material is 

Jfree( x > 0 = oE(x, t). (13.72) 

Also, we assume the metal has linear dielectric and magnetic properties, so D = 
eE and H = B//i. These depend on the bound electrons of the metal atoms. 

Inside a metal we may set pf ree = 0, because any excess charges repel each 
other and move rapidly to the surface. We showed in Chapter 7 that the character¬ 
istic time for decay of excess charge is eo/cr. Strictly speaking we should restrict 
the discussion to frequencies less than a/e o, for which it is a justifiable approxi¬ 
mation to set Pf re e = 0. Naively, the frequency at which this criterion breaks down 
is, for Cu, about 10 18 Hz. However, a detailed analysis yields the more stringent 
limit of 10 14 Hz, which is the frequency of electron collisions in a metal. Still, we 
get a qualitatively correct picture even for optical frequencies. 

With these model simplifications, the Maxwell equations in a metal become 

V • E = 0 and V • B = 0, 

V x E = — 3B/3? and V x B = p,erE + pe3E/3r. (13.73) 

The new term, p,oE, is the conduction current, and will have interesting and im¬ 
portant consequences. 

Before we construct the wave solutions of (13.73), let’s think about this prob¬ 
lem physically, and anticipate the behavior of the solution. With Jf ree = crE there 
is ohmic resistance, and therefore dissipation of energy. The energy of the currents 
and fields is transferred to random molecular energy by heat, through the interac¬ 
tions responsible for the resistance. Consequently an electromagnetic wave will 
lose energy as it propagates through the conductor. This situation is unlike a wave 
in vacuum, or in an insulator, which propagates without loss of intensity. 

To analyze (13.73) we eliminate B by the curl curl trick : Take the curl of both 
sides of Faraday’s Law. On one side of the equation we then have V x (V xE). By 
the double curl identity from Chapter 2 this quantity is — V 2 E, because V • E = 0. 
On the other side of the equation we have —3V x B/3r, and we may replace 
V x B using the Ampere-Maxwell Law. The result is a wave equation forE(x, t). 
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It can be shown that B satisfies the same equation, by taking the curl of both 
sides of the Ampere-Maxwell equation. Now, construct a solution of (13.74) in 
the form of a plane wave propagating in the x direction 

E(x, t) =E 0 e i(KX ~‘ ot) . (13.75) 

As usual when using complex functions, the real part of the right-hand side is 
understood to be the physical field. Because V • E = 0, the constant amplitude 
vector Eo must again be transverse; that is, i • Eo = 0. The wave equation (13.74) 
requires that k and co must be related according to the dispersion relation 

k 2 =+ipoco + pea) 2 . (13.76) 

If the conductivity a were 0, as it is in dielectrics, then the dispersion relation 
would imply that k is real, and equal to the wave number k = lit/\. In metals, 
however, a ^ 0 so that k 2 and k are complex. In this case it is necessary to write 
k in terms of its real and imaginary parts, k\ and /q respectively, as 

k = k\ + ii<2. (13.77) 

Substituting into (13.76) and equating the real and imaginary terms, we obtain 
two real equations, 

k 2 — k% = pear and 2k\K 2 = pcrco. (13.78) 

These may be solved for the real and imaginary parts of k, and the result is 



where the + sign is for k\ and the — sign is for /q. The general expression (13.79) 
is rather complicated, but we shall use it only in limiting cases where it is sim¬ 
pler. The quantities k\ and K 2 are both important physically. The imaginary part 
K 2 of k leads to exponential decrease of the wave amplitude with distance x in 
the propagation direction, because if we substitute (13.77) into the original wave 
expression (13.75) it becomes 

E(x, t) = E 0 e- K2X e i(KIX - M) . (13.80) 

This exponential attenuation of the wave with x is what we anticipated by phys¬ 
ical arguments: The intensity decreases because electric resistance robs the wave 
of energy. 

Equation (13.80) shows that k\, the real part of k, determines the wavelength 
and phase velocity in the conductor. As usual, these are A. = 2jt/k\ (co) and 
Uphase = co/K\(a>), where we have emphasized the dependence of these quan¬ 
tities on co. A medium in which r P hase depends on frequency is called dispersive 
because an electromagnetic signal with several component frequencies will un¬ 
dergo dispersion as it traverses the medium; that is to say it will broaden and in 
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general change shape. The motion of waves in a dispersive medium is character¬ 
ized by the group velocity, defined by v group = d co/d k\ . (In terms of (13.79) this 
derivative is u group = (d K\/d co)~K) u p h a se is the velocity of a point of constant 
phase in the wave. u group is the velocity of the envelope of a wave pulse. 10 The 
group velocity is important because it is the velocity of information carried in a 
wave signal. 

A material is called a good conductor in the context of its interaction with an 
electromagnetic wave if the charge current (cjE) is large compared to the displace¬ 
ment current (cdE/dt). The criterion for a good conductor is therefore 

a eco. (good conductor) (13.81) 

In the case of a good conductor we may neglect the term peco 2 in (13.76) so that 
k 2 is purely imaginary. Then, from the identity \fi — (1 + i)/v2, the real and 
imaginary parts of k are equal, and given by 


«1 



Note that for a good conductor 


CK [ 
CO 



(13.82) 


(13.83) 


so ck i co. Therefore the phase velocity is small compared to c. In (13.83) we 

have assumed p = po. 

Equation (13.81) is a quantitative criterion for conduction in a material to be 
important. For example, a typical metal has a of order 10 8 m -1 . For infrared 

light the frequency co is of order 10 14 s -1 and e is of order (q — 8.85 x 10 -12 F/m. 
Thus (13.81) suggests that a metal is a good conductor for interactions with in¬ 
frared light. 

According to (13.80), a wave penetrates into a good conductor by only a very 
small distance, of order 5 = 1 //q = V 2/(paco ). This characteristic length—the 
attenuation length of E in the conductor—is called the skin depth. To estimate <5 
for infrared light in a metal, let p = p o, = 10 8 fi -1 m _1 , and co — 10 14 s -1 ; 
these imply 5 — 10 -8 m. Although this is a very small distance, corresponding to 
a thickness of, say, 20 atomic layers, the continuum approximation for the metal 
is still adequate. The result is consistent with our experience that light does not 
penetrate into a metal. Even very thin metal foils, like tinfoil or A1 foil, are opaque 
to light. 

In making the estimates of the preceding paragraph we have casually approx¬ 
imated a at high frequencies by its dc or low-frequency value. In fact a is a 
function of co, so it is important to ask: Up to how high in frequency is it rea¬ 
sonable to approximate a by its dc value? The answer to this question involves, 

10 See Exercise 12. 
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again, the electron collision time which, for Cu, is about r = 2.4 x 10 -14 s. The 
approximation we made is a good one up to frequencies for which cor = 1. For 
frequencies less than a few times 10 12 Hz, we can confidently replace rr(o>) with 
the dc value. For higher frequencies we must ultimately rely on experimental data 
for cr(co). 

The magnetic field associated with the electric field in (13.75) can be deduced 
from Faraday’s Law. The frequency of oscillation of B and E must be equal, so 
3B/3 1 is — icoB, and Faraday’s Law is 


icoB = VxE = ki x E; (13.84) 

that is, 

B=ixE 0 -e i( "“" , ‘ ) . (13.85) 

co 

We must remember that the real part of the right-hand side is understood to be 
the physical field. But k has an imaginary part, so there is a phase difference 
between the magnetic and electric field oscillations. For a good conductor, k — 
^Jpcaco/ 2(1 + i ) = s/2k\ e l7T ^ 4 . In that case the magnetic field lags the electric 
field by n/4 radians, i.e., g of a cycle. 

Taking the magnitude of both sides of (13.85), the ratio of the amplitude of the 
magnetic field to that of the electric field is 


m 

\Eo\ 



(13.86) 


For Cu and a frequency of 1 MHz, a typical AM radio frequency, this is 
|Bol/|£ol = 10 6 /c. Comparing this to the ratio of the amplitudes in vacuum, 
which is 1/c, we see that in a good conductor there is an enhancement by a factor 
of 10 6 in the magnitude of the magnetic field relative to the electric field. Also, 
the energy density of the magnetic field is much larger than that of the electric 
field; the ratio of the energy densities is 


g , 

cEq/ 2 eco 


(13.87) 


Evaluating this ratio for Cu at 1 MHz gives 10 12 . For an electromagnetic wave in 
vacuum, by comparison, the magnetic and electric energy densities are equal. The 
physical interpretation of these results is easy to understand. For a good conductor 
a small electric field drives a large electric current density, which produces a large 
magnetic field. The magnetic field of an electromagnetic wave in a metal comes 
predominantly from the charge current rather than the displacement current. 


13.3.1 ■ Reflectivity of a Good Conductor 

From daily experience we know that metal surfaces are shiny. The basis of this 
effect in classical electrodynamics, as we have already discussed, is that when 



510 


Chapter 13 Electromagnetism and Optics 


light shines on a metal the electric field of the electromagnetic wave interacts 
with the free electrons. The field exerts a force, of frequency co, on the electrons 
and causes them to accelerate. What we see as the reflected light is the combined 
radiation emitted by all the accelerating electrons. The statement that metals are 
shiny, means that the reflectivity R is close to unity for metals. 

In this section we calculate the reflectivity for normal incidence at a conducting 
surface. Consider a wave traveling in the x direction, incident from vacuum or 
air for x < 0, into a conductor for x > 0. We wish to calculate Eq/Eo, the 
ratio of the amplitudes of the reflected (Eq) and incident (£o) waves. The method 
of calculation is the same that we used earlier to calculate the reflectivity for 
normal incidence at a dielectric surface, by solving the boundary conditions of 
the fields. In fact, the algebra for the conductor problem is exactly the same as for 
the earlier dielectric problem, so we do not need to set up the problem all over 
again. We can just use the final result (13.35) with appropriate reinterpretation of 
the parameters. 11 

Region 1, the region of the incident and reflected waves, will be taken to be 
vacuum, so n\ = no and m = 1. Region 2, the region of the transmitted wave, 
is the conductor, which we assume to have negligible magnetization, so ni = 
Ho- When (13.35) is applied to a metal it is necessary to think carefully what to 
substitute for «2 in the equation. For the dielectric problem, «2 can be written 
in terms of k' by (13.17) which, if medium 1 is vacuum, gives «2 = ck!/co. 
This makes sense: It is just c/u phase- In a metal it is the complex quantity k of 
(13.75) that corresponds to k', so to apply (13.35) to this case we must make the 
substitution «2 —> ck/ ox Thus the ratio of the amplitudes is 


Eq 1 — «2 — ck 

Eq 1 + «2 CO + CK 


(13.88) 


Now, for the conductor k is a complex number k\ + iK2- The reflectivity, i.e., the 
ratio of intensities, is 1 r " 1 |2 

in (13.88); that is, 


Eq/Eq\ 2 , the squared magnitude of the complex number 


I" (CO — CK\) 2 + C 2 K% 
I (co + CK]) 2 + C 2 K 2 


(13.89) 


But for a good conductor we saw in (13.82) that k\ = K2\ and furthermore, 
ck\ J>> co, so we may treat co as a small parameter in (13.89). To first order in co. 


Acock\ [Siueo 

c 2 (k\ + *r 2 ) V er 


(13.90) 


where the second equality follows from (13.82). Thus the reflectivity of a good 
conductor is only slightly less than 1. 

Very little power flows into the conductor, to be dissipated in resistance, be¬ 
cause the Poynting vector E x H is very small at the surface. The electric field in 

n 


See Exercise 9. 
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the conductor is J/ct, which is small because a is large; so E x H is small inside 
the conductor. The electric field just outside the conductor is small because the E 
fields of the incident and reflected waves nearly cancel. E, being tangential, must 
be continuous at the surface; so if E is small on one side of the surface it must 
be small on the other side. Although it is an oversimplification to assume that the 
conductivity of a metal is a real constant in the frequency range of visible light, 
the calculation does explain qualitatively the basic reason why metals are shiny. 


13.4 ■ A CLASSICAL MODEL OF DISPERSION; THE FREQUENCY DEPENDENCE 
OF MATERIAL PROPERTIES 

In our previous calculations we assumed that e, pt, and a are simply real con¬ 
stants. For high-frequency oscillations, however, such as encountered in visible 
or ultraviolet light, this assumption is not correct physically. Dispersion, i.e., the 
frequency dependence of material properties, is an important aspect of optics. The 
fact that the index of refraction of water or glass varies with frequency explains 
rainbows, why prisms spread light into its spectral components, and chromatic 
aberration in lenses. 

Dispersion—frequency dependence—occurs because the response to change, 
by atoms or electrons in a material, is not instantaneous. For example, if the per¬ 
mittivity is a function of frequency co then the constitutive equation is not simply 
D(r) = eE(r); rather, there are time delays between a change of E and the respon¬ 
sive change of polarization, or D. 

A complete physical theory describing the response of an atom to a varying 
field requires quantum mechanics. However, we can get some insight by studying 
a classical model. Consider an electron moving under the influence of atomic 
forces and an electric field E(f) = Eoe~" ot . A simple model equation for the 
motion of the electron is 


d 2 x dx ■ , 

m —— = —Kx — y - eEoe~ lcot . (13.91) 

dt 2 dt 

Here x is the electron position, with the nucleus at the origin. The right-hand side 
has various forces on the electron; (i) — Kx represents the restoring force that 
keeps the electron in the atom, (ii) —ydx/dt is a dissipative force, which in this 
model we take to be proportional, but opposite in direction, to the electron veloc¬ 
ity; the model parameter y expresses the strength of the dissipation, (iii) —eE is 
the electric force due to the electromagnetic wave. It is understood that the real 
part of (13.91) describes the electron motion, but we use the complex equation 
during intermediate stages of the calculation for convenience. At the end of the 
calculation, Re x(t) is the position of the electron. 

The steady-state solution of (13.91) has the form x(t) = xoe~'" r . The electron 
undergoes harmonic motion with the same frequency as the driving force — eE, 
although in general there is a phase shift from the phase of the complex ampli- 
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tude X(j. It is straightforward to show that the steady-state solution is 


x(t) = 


—eE 0 

K — mco 2 — icoy 


e -iw, 


(13.92) 


To obtain (13.92) we just substitute x = xpe - '"' into (13.91) and solve forxo. 
For any specified initial condition the solution to (13.91) also includes a transient 
term, but that decays in time and (13.92) remains. 


13.4.1 ■ Dispersion in a Dielectric 

The electric dipole moment of an atom, due to the displacement from equilibrium 
of one electron, is 


p(0 = -ex(t) - aE(t), (13.93) 

assuming the dipole moment is 0 at the position of static equilibrium (x = 0). 
Using (13.92) shows that the parameter a, called the polarizability of the atom, is 


a - 


K — mco 2 — icoy 


(13.94) 


Note that a depends on co and is complex. In this microscopic calculation a is the 
polarizability of a single atom. The polarization P(x, t) is the dipole moment per 
unit volume of bulk material induced by a macroscopically averaged electric field. 
If the atomic density v is small, so that the polarization of one atom doesn’t affect 
the others, then the local field experienced by an atom is equal to the macroscopic 
field, and the polarization P due to the atomic dipole moments is P = vp = vaE. 
The permittivity e of the material is defined by eE = epE + P, so 


€ = ep + va. 


(13.95) 


Note that this permittivity is complex. 

There is an interesting correction that must be applied to (13.95) if the material 
is dense, because then the force in (13.91) must also include a contribution due 
to the other polarized atoms in the neighborhood of the electron. Under those 
conditions the local field at the atom is larger than the macroscopic average field, 
i.e., Eiocai = E + P/3ep, a result from Section 6.3.2. Then the dielectric constant 
e/ep is given by 


e 3ep + 2va 
3e 0 - va ’ 


(13.96) 


which is another way to write the Clausius-Mossotti equation (6-39). 

Now, how does an electromagnetic wave propagate in this model dielectric? 
The electric field of a plane wave, propagating in the x direction, polarized in the 
y direction, and with frequency co is 


E(x, t) = fpe'^-^j. 


(13.97) 
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The dispersion relation is the same as for a nonmagnetic dielectric, 

k = co^/ejlo. (13.98) 

But here e(co) is complex, so k(co) = k\(oj) + iK 2 {co). Therefore the electric 
field is 


E(x, t) = Eoe- K2X e i( - K ' x - wt) j. (13.99) 


This result displays both absorption and dispersion. The energy absorption length 
d is \/2k 2 ; i.e., the energy density decreases exponentially with distance, by the 
factor 1/e for each length d. The index of refraction is n = c/v = ck\/co, where 
v is the phase velocity co/k\ . Both d and n depend on frequency through the co 
dependence of a in (13.94). 

Figure 13.11 illustrates the phenomena of absorption and dispersion associated 
with the electron dynamics within this simple model. Assuming va/e o is small, 
the index of refraction is 


ac\ ve 2 ~ ft)2 ) 

n —-— 1 -)- — —-z -——-— 

co 2€oma>Q (co^ — a> 2 ) 2 + {coy/m) 2 

and the inverse absorption length is 


d~ ] = 2 k 2 = 


ve 2 co 2 y/m 

eo me (cu 2 — co 2 ) 2 + {coy/m) 2 


(13.100) 


(13.101) 


The quantity coo = - sJK/m is the resonant frequency for zero damping. The graph 
in Fig. 13.11 shows n — 1 in units of ve 2 /(2eo mcofy and d~ l in arbitrary units. 
The dissipation parameter y has been set equal to OAmcoo for illustration purposes 
only. The behavior is an example of resonance, because the absorption becomes 
strong when the driving frequency co is near the natural frequency «o- 



FIGURE 13.11 The inverse absorption length d 1 = 2k2{co) and the index of refraction 
n = ck\ {a>)/a> versus a>/a>o, where a>o = -s/K/m. 
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Below the resonant frequency coo, the index of refraction n(oj) increases 
with co. For many transparent liquids and solids the characteristic atomic fre¬ 
quencies are in the ultraviolet, so in the visible range n(u>) increases with to. 
Consequently violet light refracts more than red light in glass or water, which 
is certainly consistent with our knowledge of prisms or rainbows. At higher fre¬ 
quencies, in the region of strong absorption, there occurs anomalous dispersion, 
in which the index of refraction decreases as a function of frequency. 

Our classical model has only a single atomic frequency, but a real atom has 
many resonant frequencies, corresponding to quantum transitions between elec¬ 
tron states with frequencies A E/h. If there are no transitions in the visible part of 
the spectrum, then the material is colorless and transparent. If there are transitions 
in the visible part of the spectrum, then the absorption determines the color of the 
material. In real materials n{u >) is a complicated function. 

13.4.2 ■ Dispersion in a Plasma 

A plasma is an ionized gas consisting at least partly of free electrons and positive 
ions; it is therefore a conducting material. We have not discussed plasma prop¬ 
erties previously, although it is a fascinating subject. Plasmas are interesting for 
themselves, important for applications, and ubiquitous—sparks are plasmas, illu¬ 
minated gas tubes contain plasmas, and plasmas play a key role in fusion power 
research. Plasmas are also an important part of the Universe—the Sun and stars 
are largely plasma. We will explore the dispersion of conductivity in a plasma 
using the classical electron model. 

Because the electrons in a plasma are not bound in atoms, we set A" =0 in 
the equation of motion (13.91). The electron velocity v = dx/dt may then be 
calculated for the steady-state solution, 


v(r) = 


—icoe 

ma> 2 + icay 


E(0- 


(13.102) 


Letting v e be the electron density in the plasma, the current density is J = 
— ev e y = crE, and so the conductivity a is 


CT(ft>) = 


i(oe 2 v e 
mco 2 + icoy 


(13.103) 


Again the material parameter is a complex number. We have previously ob¬ 
tained the wave solution of Maxwell’s equations in a conducting medium. The 
fields E(x, t) and B(x, t) are given by (13.75) and (13.85), with wave depen¬ 
dence e dKx-a>t) -phe dispersion relation for k(co) is (13.76). 

For a dilute plasma, the electron collisions that produce dissipation of electron 
energy are rare, so the dissipation is small. In this case we may set y = 0. (Also, 
the polarization and magnetization are negligible for a dilute plasma, so p = po 
and € = to-) Then the conductivity a is purely imaginary 
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ie 2 v e 

a{co) = -(13.104) 

mco 

Equation (13.104) is a remarkable relation because a in other contexts is so easily 
understood and intuitive. What does it mean that a is purely imaginary ’? Recall 
that the power per unit volume lost to ohmic heating is 

■777 = J • E. (13.105) 

dv 

The real parts of J and E, i.e., the physical fields, must be used in this relation. 
But since a is imaginary, J and E oscillate 90° out of phase and the time average 
of J • E is 0. That is, there is no energy loss in the dilute plasma. This is not 
really surprising, because we set the only dissipation in the model—the damping 
parameter y —equal to 0. 

The dispersion relation (13.76) becomes 


k 1 = p-o^oco 2 


P-pe 2 v e 

m 



(13.106) 


In the second equality we have defined co p , which is called the plasma frequency, 

by 


(13.107) 

If Co is greater than <w p , then the wave vector k is real, so the wave propagates 
without attenuation through the plasma. There is dispersion, because the phase 

velocity co/k depends on frequency, u p hase = c/y l — (<w p /<w) 2 ; but there is no 
attenuation. An interesting feature for co > <w p is that the phase velocity is greater 
than c. n The group velocity is less than c. 13 

On the other hand, if co is less than <w p , then k is purely imaginary, so the wave 
does not propagate at all; its amplitude just dies out exponentially with decay 
length d = c/^jco 2 — co 2 . But if waves with co < co p cannot propagate in the 
plasma, what happens if a low-frequency wave from vacuum impinges upon a 
plasma boundary? Energy is conserved, but there is no dissipation in the plasma, 
and no propagation into the plasma; so the wave can only be reflected back into the 
vacuum. When the plasma is disturbed by a low-frequency field, charges move to 
screen out the field. If co < co p then the electrons can move fast enough to prevent 
the field from penetrating far into the plasma, and so the electromagnetic wave 
reflects from the boundary. 

12 In Chapter 14 we’ll see another example where the phase velocity is greater than c, for an electro¬ 
magnetic wave in a wave guide, and discuss the implications. 

13 See Exercise 18. 
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Radio and the Ionosphere 

An important application of the interaction between plasmas and electromagnetic 
waves is to radio propagation. The upper atmosphere is a plasma, called the iono¬ 
sphere, with a typical number density of electrons v e = 10 11 free electrons/m 3 . 
These electrons come from atoms that have been ionized by ultraviolet radiation 
from the sun. There is an equal charge density of positive ions but their motion 
can be neglected because their mass is much larger than the electron mass. Ac¬ 
cording to (13.107), the plasma frequency for the ionosphere is co p % 2 x 10 7 s -1 , 
which corresponds to / p % 3 x 10 6 Hz. The decay length in the ionosphere for 
low frequency waves, those with oj <<C o> p , is c/co p ~ 15 m. 

AM radio waves have frequencies from 0.55 to 1.60 MHz, below the plasma 
frequency / p , and they therefore reflect from the ionosphere. The reason an AM 
signal may be received at distances of hundreds of miles from the transmitter is 
that the radio waves can bounce back and forth multiple times from the ionosphere 
and the Earth’s surface. For AM and shortwave radio propagation the ionosphere 
acts as a global reflecting layer. 

The existence of the ionosphere was predicted independently by Heaviside 
and Kennedy, in 1901, from the observation that Marconi’s radio signals reached 
Nova Scotia from London with far greater intensity than expected. Indeed some 
theorists had claimed that radio signals could not be received at such large dis¬ 
tances because of the curvature of the Earth. For many years people referred to 
the ionosphere as the Heaviside layer. 14 Today it is known that the ionosphere has 
a complex and dynamic structure with several layers. The reason why the range 
of an AM radio station becomes extended at night, as is commonly observed, 
is that the outer atmosphere cools after sunset and the warmer ionosphere rises. 
Therefore radio waves that reflect from the bottom of the ionosphere come back 
to Earth farther away than during the day. 

FM radio (~ 10 8 Hz) and TV (10 8 -10 9 Hz) have co > co p , so these signals 
pass through the ionosphere without being reflected back. FM and TV signals 
cannot be received from a transmitter far beyond the horizon, because without 
ionospheric reflections, the receiving antenna must be within the line of sight of 
the transmitting antenna. Also, signals at high frequencies (/ > 10 8 Hz) must be 
used for communication between Earth and satellites, because these signals can 
pass through the ionosphere. 


FURTHER READING 

1. M. Bom and E. Wolf, Principles of Optics, 7th expanded edition (Cambridge Univer¬ 
sity Press, New York, 1999). This is a classic book on the electromagnetic theory of 
propagation, interference, and diffraction of light. 

14 OUver Heaviside (1850-1925) was a brilliant mathematical physicist, though eccentric. He had only 
an elementary formal education and beyond that was largely self-taught. He did extensive research on 
field theory, and published a three-volume work entitled “Electromagnetic Theory,” not unlike this 
book but longer and more original. Another scientist complained to Heaviside that his papers were 
very difficult to read. His retort: "That may well be but they were much more difficult to write.” 
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EXERCISES 

Sec. 13.1. Electromagnetic Waves in a Dielectric 

13.1. Starting with Maxwell’s equations in a linear dielectric, and equation (13.6) for the 
fields of an electromagnetic wave: 

(a) Prove that Eg and Bg are perpendicular to k. 

(b) Prove that Bq is perpendicular to Eg. 

(c) Prove that Bq/Eq = k/at and Bq/Eq = /xeco/k. 

(d) Prove the dispersion relation co = vk and determine v. 

13.2. (a) Show that the mean energy density of the plane wave (13.6), averaged over a 

period of oscillation, is ( u ) = eE^/2. 

(b) Show that the intensity (energy flux) is / = (u)v, and explain why this means 
that the energy flows with velocity v. 

13.3. Suppose the complex wave amplitude in (13.6) is Eg — e' ® E, where <f> and E, 
are real. What is the magnetic wave amplitude Bg? Determine the intensity for this 
electromagnetic wave, and show that it is the same as (13.12). Explain the physical 
meaning of complex Eg and Bg. 

Sec. 13.2. Reflection and Refraction at a Dielectric Surface 

13.4. Imagine a lucite prism (n = 1.5) whose cross section is a quarter circle of radius a. 
As shown in Fig. 13.12, one flat side rests on a table, and light is incident normal 
to the other flat side. The region from P to Q on the table is not illuminated by light 
from the prism. Determine the position of Q. [Answer: OQ = 3a/x/5] 


i 

i 

i 



FIGURE 13.12 Exercise 4. Light incident on a lucite prism whose cross section is a 
quarter circle. 


13.5. Consider light Iraveling from x = — oo, incident normally on a plate of glass with 
thickness a. The plate is parallel to the yz plane, with one face at x = 0 and the 
other at x = a. The index of refraction is ng = 1 for x <0 and x > a. and n = 1.5 
for 0 < x < a. The electromagnetic field in the region x < 0 is a superposition 
of right and left traveling waves (where right means i and left means — i), which 
are the incident and reflected waves. In the region 0 < x < a there are both right 
and left traveling waves, and in the region x > a there is only the transmitted right 
traveling wave. 
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(a) Write E(x, t) and BO, t) in the three regions, letting j be the polarization 
direction. Write the four boundary conditions on the wave amplitudes. 

(b) Solve for the transmission coefficient T, i.e., the ratio of transmitted intensity 
to incident intensity. 

(c) Plot T as a function of ka, where k is the incident wave vector. 

13.6. (a) Derive Fresnel’s equation (13.56) for the reflected wave amplitude of TE po¬ 

larized light, from (13.55) and Snell’s law. 

(b) Do the same for (13.67), TM polarization, by deriving it from (13.66) and 
Snell’s law. 

13.7. From the amplitudes Eq, E q, and E' f ' for light scattering from a dielectric surface, 
show that the conservation of energy equation (13.70) holds at arbitrary angles for 
(a) TE polarization and (b) TM polarization. 

13.8. Brewster’s angle. Consider light incident on the plane interface between two di¬ 
electrics, from index of refraction n ] to nj, at angle of incidence 6. For incident 
TM polarized light the reflection is 0 at 6 = <9g, where tan= nj/n \. (The 
dielectrics are assumed to have /xj = /X 2 = M0-) Therefore, for unpolarized light 
incident at 6 = 6g the reflected light is TE polarized. 

(a) Prove that for a TM polarized incident wave, the electric field of the transmitted 
wave is parallel to the direction of the reflected ray if 6 = 6g. (The direction 
of the reflected ray is well-defined, although the intensity of the reflected wave 
is 0.) 


The Rainbow Caustic 



FIGURE 13.13 Illustration of how light refraction and reflection in a water droplet 
causes a rainbow. Light incident from the left is refracted on entering the spherical drop, 
reflects partially from the back surface, and is refracted on leaving the front surface. (Only 
rays incident above the midplane are shown.) The caustic—the region where the exiting 
rays are most concentrated—is the rainbow. The colors of the rainbow are the result of 
dispersion; the angle of the caustic varies with wavelength. 
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(b) Calculate Brewster’s angle for light incident from air into water. 

(c) Calculate Brewster’s angle for light incident from water into air. 

An example of polarization by reflection is the light in a rainbow. Rainbow 
light is produced by a three-step process of light scattering from a spherical drop of 
water: refraction into the drop, internal reflection at the back surface, and refraction 
back out of the drop. Figure 13.13 illustrates the paths of the rays. At the internal 
reflection of the caustic, the angle of incidence is approximately 39°. Because this 
angle is close to Brewster's angle for water, rainbow light is highly polarized— 
about 96% polarized. 


Sec. 13.3. Electromagnetic Waves in Conductors 

13.9. (a) From first principles, set up the boundary conditions for a plane wave incident 
normally on the surface of a conductor. Let the incident wave travel in the x di¬ 
rection, and be polarized in the y direction. The reflected and transmitted waves 
travel in the — x and +x directions, respectively. The wave vector k in the con¬ 
ductor is complex, and the dispersion relation is (13.76). Solve the boundary 
conditions, and show that the amplitude of the reflected wave is correctly given 
by (13.88). 

(b) For a good conductor, i.e., a S> ea>, derive (13.90). 

(c) Show that for a good conductor, at the surface the field of the reflected wave is 
approximately equal but opposite to the field of the incident wave. 

13.10. In this chapter we have discussed reflection and transmission for dielectrics, in 
which o/eco & 0, and for conductors, in which o/eco ;» 1. But many interesting 
systems are intermediate between these extremes. In the current exercise, it is a 
fairly good approximation to use the dielectric expressions for R and 7\ 15 

For biological tissues with high water content, including skin, e and a depend 
on frequency. At 2450 MHz their values are e = 47eo and a = 2.21 fi~'m _1 . 
What is the value of cr/e&> for this system? Evaluate /q from (13.79) to estimate 
the attenuation length. 

If a radar signal at 2450 MHz impinges on skin and other similar tissues, from 
air, how much of the incident power is absorbed? Assume the tissue thickness is 
15 cm. (The precise answer is T = 0.43.) 

13.11. Calculate the radiation pressure exerted by light incident normally on the surface of 
a good conductor. (Hint: The force is equal to the rate of change of momentum of 
the light.) Express the result in terms of the intensity of the incident light. Explain 
qualitatively how the pressure results from the Lorentz force on the electrons in the 
conductor. 

13.12. As an example of group velocity, consider a Gaussian wave pulse of some quantity 
4> (x, t) that undergoes wave motion, given by 

/ OO ,/L 

e Wcx-<ot) f(k) d* 

-oo 27r 

15 The general case is treated in detail in U.S. Inan and A.S. Inan, Engineering Electromagnetism 
(Addison-Wesley, Menlo Park, California, 1998). 
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where f(k) = foe~^ k ~ k °^ a . The k range is (—oo, oo), but the integrand is 
peaked at k = k$, and the width of the peak is of order 1 /a. Assume that within 
the peak a>(k) may be approximated by 

co(k) = co(ko) + (k - ko)aj'(ko). 

(a) Evaluate the integral, and obtain explicitly the function rp(x, t). (Hint: Let k — 
ko + q, change the variable of integration to q, and use a table of integrals 
or an analytic computer program (e.g., Mathematica or Maple) to evaluate the 
integral.) 

(b) Show explicitly that the phase velocity is a>{ko)/ko and the group velocity 
is u>'(k 0 ). 

13.13. The dispersion relation for deep water gravity waves is a> = ^[gk. Show that the 
group velocity is one-half the phase velocity. (This leads to an interesting effect 
that may be observed in water waves produced by dropping a pebble into a pond. 
The outgoing wave group travels slower than the individual ripples. Wave crests in 
the group are bom at the back, travel through the group at twice the group velocity, 
reach maximum amplitude near the middle, and die out at the front.) 

13.14. Prove and explain why \Ti = (1 +i)/Vl. Also, why is the other root (—(1 +0/V2) 
not used in calculating tq and ki in (13.82)? 

Sec. 13.4. Dispersion 

13.15. Typical laboratory plasmas have number densities of 10 18 — 10 22 electrons/m 3 . 
What is the corresponding range of plasma frequencies? What is the correspond¬ 
ing range of attenuation lengths for low frequency electromagnetic waves? (Low 
frequency means much less than the plasma frequency.) 

13.16. As a simple model, the conduction electrons in a metal may be considered to be a 
plasma (together with the ion cores that make the plasma neutral). 

(a) Calculate the plasma frequency in copper, assuming one conduction electron 
per atom. The density of copper is 8.93 x 10 3 kg/m 3 . 

(b) According to this model, for what wavelengths is copper transparent to elec¬ 
tromagnetic waves? To what part of the electromagnetic spectrum do these 
wavelengths belong? 

13.17. In their interaction with electromagnetic radiation, the alkali metals behave approx¬ 
imately like plasmas. 

(a) What is the plasma frequency for Na? 

(b) Sodium is transparent for X < 210 nm. Show that this agrees with the answer 
to (a). 

13.18. For an electromagnetic wave in a dilute plasma, sketch a graph showing both the 
phase velocity and the group velocity as functions of frequency a>. 

General Exercises 

13.19. Consider a model of the conductivity of a plasma in which the conductivity is 
purely imaginary cr = iaj', that is, J = ioj E in the complex phase representation 
of the wave. 
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(a) From Maxwell’s equations, derive the relation between k and to for an electro¬ 
magnetic wave propagating in the plasma, with fields given by 

E = EQ]e i(KX - m \ 

B = B 0 ke i(KX ~ u>,) . 

(b) Show that J lags behind E by 90 degrees. 

13.20. What are the reflectivity and transmissivity for AM radio waves, frequency 1 MHz, 
incident normally on the surface of a lake? (Assume water is a dielectric with 
6/(?0 = 81 at this frequency.) 

Theoretical physics often uses simple models to explain physical phenomena, at 
least qualitatively. The next two problems concern the classical electron theory 
that we used to estimate the frequency dependence of electromagnetic parameters. 

13.21. Using the classical electron theory we derived a formula for the polarizability of a 
molecule 


, , n e e 1 lm n e e 2 

ot(oj) = -=---«- j. 

a>Q — a> z — i{y/m)co mco g 

This is essentially (13.94) but we have extended it by calling n e the number of 
electrons that contribute to the dipole moment of the molecule, and in the approxi¬ 
mate form we have assumed a> <g coo- Here coo = <JK/m is the natural frequency, 
assumed to be the same for all electrons. This model should give a reasonable es¬ 
timate of the dielectric constant k (= e/eo) according to the Clausius-Mossotti 
relation 


k — 1 Na 
k + 2 3eq ’ 


where N is the molecular density. 

(a) Estimate hco o using this model for water (k = 1.8 for visible light). 

(b) From what you know about atomic and molecular physics, does your result 
make sense? 

13.22. Using the classical electron theory we derived a formula for the (complex) conduc¬ 
tivity of a plasma 


v e e 2 lm 

°(°>) = 7-7—r-— = , 7 7 . . 

(y/m) — ico (y/m) — tco 

where co p is the plasma frequency. For a dilute plasma the damping factor y is 
small, so for the purposes of illustration assume y = 0.01ma> p . Consider a typical 
laboratory plasma, with v e = 10 15 cm 3 . 

(a) For what frequencies is Reu 3> Inter? Describe in words the behavior of an 
electromagnetic wave in the plasma if the frequency is in this range. 

(b) For what frequencies is Rea <3C Im ol Describe in words the behavior of an 
electromagnetic wave in the plasma if the frequency is in this range. 
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13 . 23 . Figure 13.13 shows the light scattering process that creates the primary rainbow. 
Light rays at varying impact parameter refract into a spherical water drop, reflect 
from the back surface, and refract out of the drop. (In the figure, only the rays 
entering the upper half of the drop are shown. The rays shown are the rays that 
would reach the ground.) At a scattering angle of about 42° there is a concentration 
of scattered rays, called the caustic; and that somewhat more intense scattered light 
is the rainbow. 

(a) Explain why the ordering of colors (ROYGBIV) is red at the outer edge of the 
arc, and violet at the inner edge. 

(b) A secondary rainbow, in which the order of colors is reversed, is sometimes 
visible at a higher angle than the primary. Explain this second arc. 

(c) Explain why the area inside the primary rainbow is brighter that the area out¬ 
side. 

13 . 24 . The criterion (13.81) f or a good conductor depends on frequency as well as conduc¬ 
tivity. If the frequency is low enough, matter that we intuitively consider insulating 
can behave as conducting. This exercise uses that idea. 

(a) What are the skin depths of 10 kHz electromagnetic waves for: dry earth 
(e = 3fQ, a = 10~ 4 S/m), wet earth (e = IOsq. o = 10 -2 S/m), and sea water 
(e = 81eo, a = 4 S/m)? Frequencies in this range are used for naval commu¬ 
nication. 

(b) What are the reflectivities of these materials for 10 kHz waves? 

Computer Exercises 

13 . 25 . Use computer graphics to plot the reflection and transmission amplitudes, £q/£o 
and £q/ Eq respectively, as functions of the angle of incidence 6, for TM polarized 
light incident from air into diamond. (Transverse magnetic (TM) polarization has 
B perpendicular, and therefore E parallel, to the plane of incidence.) The index of 
refraction of diamond is 2.42. 

13 . 26 . Calculate the reflectivity R and the transmissivity T for TE waves obliquely inci¬ 
dent on the interface between two dielectrics. 

Use computer graphics to plot R and T as functions of the angle of incidence 6 
for the case n\ = 1 and m = 1.5. 

13 . 27 . Consider the boundary conditions (13.41) and (13.44) for light incident normally 
on a dielectric coating on glass. The reflectivity is £ = £q/£q| 2 . 

Plot £ as a function of k\a in the range from 0 to 2jt , for various values of . 
(Let ni = 1.5 for glass.) Note that n\ = 1 and n\ = no are both equivalent to 
no coating. For what value of n \ is it possible to have £ = 0? What happens if 
n\ > 
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Wave Guides and 
Transmission Lines 


Microwaves are electromagnetic waves with frequencies from 300 MHz to 
300 GHz. The range of wavelengths is 1 mm to 1 m. This part of the electromag¬ 
netic spectrum is very important in modem electrical engineering. The ability to 
generate microwaves, and to control their direction and intensity, has created new 
technologies that we use every day, in some cases without even being aware of it. 

The first use of microwaves was for radar. Efficient microwave generators— 
the magnetron and the klystron—were developed during World War II for military 
radar. 1 Today radar has many commercial applications, including air traffic con¬ 
trol, weather observation, and enforcement of speed limits. Detecting an object 
by scattering of waves—the basic idea of radar—requires that the wavelength be 
smaller, or at most comparable, to the size of the object. Microwaves have the 
wavelengths appropriate for radar. 

Another significant application of microwaves is in communications. Sending 
large amounts of information rapidly and economically over large distances re¬ 
quires many separate channels. So, for example, in 1981 the cellular phone system 
in North America was established by the Federal Communications Commission 
to operate in the microwave frequency ranges 824-849 MHz and 869-894 MHz. 
The cellular system has grown so that now there are more than 30 x 10 6 portable 
telephones, the latest operating in the GHZ range, over 20 x 10 3 cell sites in 
the United States, and many more worldwide. As another example, microwave 
links between antennas on high towers carry television or telephone signals over 
thousands of miles. (The UHF channels of commercial television are in the mi¬ 
crowave range 470-890 MHz.) Microwaves are not reflected by the ionosphere, so 
each tower in the microwave network must be in the line-of-sight of its neighbors. 
The concept of microwave links has been extended by using satellites orbiting the 
Earth as the repeater stations. 

A variety of other applications of microwaves include research in atomic and 
molecular physics (electron spin resonance), the use of microwave cavities in 
high-energy charged-particle accelerators, and microwave ovens (2.45 GHz). On 
a cosmic scale, the radiation remnant of the big bang is a blackbody spectrum at 
temperature 3 K, which is maximum in the microwave range. 

1 Much of the early development of radar was done at the M. I. T. Radiation Laboratory. “It’s simple,” 
[I.I.] Rabi told the theorists who were staring at the disassembled parts of the [magnetron] tube. “It’s 
just akind of whistle.” “Okay, Rabi,” Edward Condon responded, “how does a whistle work?” (History 
of Physics Collection, Niels Bohr Library, American Institute of Physics) 
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An essential component of microwave circuits is the wave guide —a hollow 
metal tube with, most simply, a rectangular cross section—in which the electro¬ 
magnetic waves propagate with little attenuation from one part of the device to 
another. For example, in a radar station microwaves generated in a resonant cav¬ 
ity travel through a wave guide to a hom antenna from which they emerge into 
free space to be scattered by the detected objects. 

The propagation of electromagnetic waves in a finite volume containing either 
vacuum or loss-free dielectric, and bounded by conductors, is thus an important 
problem in electromagnetic field theory. The propagation is rather different from 
propagation in free space, because the fields are affected by charge and current 
distributions induced in the conducting walls. Heuristically we can picture it this 
way: A beam of radiation sent into the end of a wave guide will reflect from any 
conducting wall that it hits, and so it will travel down the guide bouncing back and 
forth between opposite walls. Physically, the fields in the guide and the charges 
in the walls influence each other and so affect the propagation. Mathematically, 
the propagation characteristics are determined by partial differential equations and 
boundary conditions, the whole set of equations being satisfied in a self-consistent 
way. 

We’ll analyze several basic examples involving wave motion in bounded vol¬ 
umes. We’ll find that for a given frequency there are a finite number of discrete 
modes of propagation, the number depending on the frequency; and there is dis¬ 
persion, i.e., the group velocity is different from the phase velocity. 


14.1 ■ ELECTROMAGNETIC WAVES BETWEEN PARALLEL 
CONDUCTING PLANES 

Our first example is a very idealized case—propagation of waves in the space 
between infinite parallel planes. Although not realistic, 2 this simple model will 
provide some useful insights, which will later help us understand the realistic 
problem of a rectangular wave guide. 

Figure 14.1 shows the geometry of this example. The planes y = 0 and y = b 
are boundaries of perfect conductors in the regions y < 0 and y > b. The re¬ 
gion between the planes 0 < y < b is vacuum, or some weak dielectric like air 
for which the polarization is negligible. The question is, how do electromagnetic 
waves propagate in this bounded space? We shall solve the field equations and 
boundary conditions for harmonic waves propagating in the +z direction. (Be¬ 
cause the planes are infinite the propagation is the same in any direction orthogo¬ 
nal to the y axis.) For simplicity we shall only consider waves that are translation 
invariant in the x direction, a further idealization. 

By a perfect conductor we mean that the conductivity of the material is infinite. 
Then the skin depth is 0, and the electric and magnetic fields do not penetrate into 
the material. Another way to understand why there are no fields in the material 

2 This “parallel plane transmission line” does resemble in some respects a device in microwave engi¬ 
neering called a microstrip transmission line. 
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FIGURE 14.1 Parallel planar boundaries of a conducting material. For y < 0 and 
y > b the conductivity is infinite, and for 0 < y < b the conductivity is 0. 


is to recall that E = 0 inside a perfect conductor. It then follows from Faraday’s 
law that <36/3? = 0, so any magnetic field must be constant in time. We set 
B = 0 initially, so it remains 0. We will use this simple fact several times in the 
discussions in this chapter. 

The normal component of B and the tangential components of E, being con¬ 
tinuous at any surface, must be 0 at the planes y = 0 and y = b. Also, there will 
be surface charge a and surface current K on the boundary planes, induced by the 
fields. Approximating a metal surface by a perfect conductor gives a reasonable 
description of the fields and surface charge distributions, but it cannot be used to 
determine the ohmic loss in the walls. 

A harmonic wave propagating in the z direction and uniform in the x direction 
has fields of the form 3 

E(x, t) = [i£^(y) + j E y {y) + iE z (y)] e i(kz ~ a,) (14.1) 

B(x, t ) = [i B x (y) + j B,00 + iB z (y)] e i(k ^\ (14.2) 

As usual when using complex functions, the real part of (14.1) or (14.2) is un¬ 
derstood to be the physical field. We could insert these forms into the Maxwell 
equations and deduce the general solutions, but it will be simpler to anticipate 
some properties of the basic solutions, and pick specific cases of (14.1) and (14.2). 
The boundary conditions of the fields will place additional constraints on the so¬ 
lutions. Indeed we will find that the boundaries have significant and interesting 
effects on wave propagation. 

In an infinite vacuum the plane electromagnetic wave is transverse in both E 
and B. Such a wave, for which E and B are both orthogonal to the direction of 
propagation, is called a TEM wave (“transverse electric and magnetic”). A TEM 
wave propagating in the z direction has E z = 0 and B z = 0. In a bounded volume 
there are also solutions for which only one of E z and B z is 0. Such solutions with 
E z = 0 are called TE waves (“transverse electric”), and those with B z = 0 are 
called TM waves (“transverse magnetic”). Superpositions of any of these waves 
are also solutions. Furthermore, the harmonic waves are complete: any solution 

3 In this chapter we use z for the unit vector in the z direction. 




526 


Chapter 14 Wave Guides and Transmission Lines 


can be written as a superposition of harmonic waves. For Fig. 14.1 we shall begin 
by deriving the TEM solution, which is rather simple, and then analyze the more 
complicated TE and TM waves. 

14.1.1 ■ The TEM Solution 

There exists a solution of the field equations for which E x = E z = 0 in (14.1). 
As we shall see, this solution is a TEM wave. Gauss’s Law for this case is V • E = 
dEy/dy =0, so E y is a constant E o; the electric field between the planes is 

E(x,r) = ]E oe i(kz ~ M) . (14.3) 

We can determine the magnetic field from Faraday’s Law, V x E = —3B/3 1. The 
frequency of oscillation of B must be the same as E, so —3B/3t = icoB\ thus 

B(x, t) = —V x E = -\ — e i(kz ~ w, \ (14.4) 

CO CO 

The Ampere-Maxwell Law c 2 V x B = 3E/3 1 must also be satisfied; a short 
calculation shows that this field equation requires 


Co = ck. 


(14.5) 


(Alternatively we could arrive at (14.5) from the wave equation.) Thus the wave 
propagates in the z direction with speed c, for any frequency. The real parts of 
the complex fields in (14.3) and (14.4) are the physical fields. As the electric and 
magnetic fields at a fixed point oscillate in time, their oscillations remain perfectly 
in phase. 

The TEM wave resembles a plane wave in free space, but chopped off below 
y = 0 and above y = b. The fields are illustrated in Fig. 14.2. The wave fronts 
are planar strips orthogonal to the direction of propagation (z) with 0 < y < b. At 
each point, E, B, and z form an orthogonal triad, and |B| = |E|/c. The Poynting 
vector is 


1 

S(x, t) = —E x B = 
MO 


z—— cos 2 (kz — cot), 
MO c 


(14.6) 


so energy flows only in the direction of propagation, with intensity E^/(2p.oc). 

However, there is another aspect of this problem—the charge and current on 
the boundary surfaces. Their densities, cr and K respectively, can be calculated 
from general boundary conditions. The tangential components of E are 0 at the 
surfaces y = 0 and y = b, in accord with the continuity of E t because E = 0 in 
the perfect conductor. However, the normal component E n is discontinuous, and 
its discontinuity is rr/eo; thus the surface charge densities are 


cr = eoEy = €qEq cos(kz — cot) at y = 0; 
a = —eo E y = — eo£o cos(kz — cot) at y = b. 
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K ir/k 2 iv/k 


(b) 

FIGURE 14.2 Snapshot of the TEM mode, for waves between parallel conducting 
planes. The diagram shows schematically (a) the electric field and charge density, and 
(b) the magnetic field and current density, at an instant of time. A dot indicates a vector 
pointing out of the page, and a cross, one into the page. The charges and currents reside on 
the boundary surfaces at y = 0 and y = b. The fields and densities move in the z direction 
as time passes. 


The normal component of B is 0 at y =0 and b, in accord with the continuity 
of B n . However, the tangential component B t is discontinuous, corresponding to 
a surface current density K = n x B/^o — iK z with 

B x Eo 

K z — -= H-cos (kz — cot) at y = 0; 

Mo Mo c 

B x Eo 

K z — H-=-cos (kz — cot) at y = b. 

Mo MO c 

Figure 14.2 is a schematic diagram of the fields, charge, and current distribu¬ 
tions, in the form of a snapshot at t = 0 of one wavelength of the TEM mode. 
(The wavelength A is 2n/k.) Figure 14.2(a) shows the electric field and the sur¬ 
face charge, and Fig. 14.2(b) shows the magnetic field and the surface current. 

Unlike TEM modes in an infinite vacuum, for which E can point in any direction 
orthogonal to the direction of propagation, the TEM mode in the bounded volume 
must have E pointing across the gap. 
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The TEM mode is rather featureless. It propagates with velocity c for any 
frequency. The wave fronts are planar cross sections of the gap, orthogonal to 
the direction of propagation, and the fields are independent of x and y. The other 
modes of propagation (TE and TM) are more interesting. 

14.1.2 ■ TE Waves 

What we mean by a TE wave is that E is transverse to the direction of propagation. 
But B will have components in both transverse and longitudinal directions. 

For a TE wave, E z = 0 in (14.1). Then Gauss’s law in the bounded volume is 


V E = dEy/dy = 0, 


so E y is constant. In the previous section we considered E y = Eq, a nonzero 
constant, and E x = 0, which led to the TEM mode. Now we take Eq = 0 and 
consider the orthogonal polarization with E x nonzero. That is, we seek solutions 
for which the electric field is polarized in the x direction, 

E(x, t) = i E x (y)e l(kz - Wt) . (14.7) 


Our first task is to determine E x (y ) and the relation between the frequency co and 
wave vector k. 

We can immediately determine the form of the magnetic field from Faraday’s 
Law, 9B/3 1 = — V x E. The frequency of oscillation of B must be the same as E, 
so 9/3 1 may be replaced by —ico. Therefore, evaluating V x E, 


B(x, t) = —V x E : 
co 


- \kE x (y ) , JE' x (y) 

J i 2 


co 


p i(kz-cut) 


(14.8) 


(The prime denotes the derivative with respect to y.) Note that Gauss’s law V B = 
0 is already satisfied by the field in (14.8)—a nontrivial result. 

One more Maxwell equation must be satisfied—the Ampere-Maxwell Law, 
3E/3 1 = c 2 V x B. Substituting the functions (14.7) and (14.8) this field equation 
implies 


or, 



Kiy) = - - * 2 ) E x (y). (14.9) 

This differential equation will be easy to solve, but the solution depends on the 
boundary conditions. The most general solution of (14.9) is 


E x (y ) = ci sin vy + C 2 cos vy. 


(14.10) 
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where c i and C 2 are constants, and cu 2 /c 2 = k 2 + v 2 . The boundary conditions 
place constraints on c\, ci, and v. 


Boundary Conditions 

The transverse components of E are continuous at any surface, so in particular at 
the boundary planes y = 0 and y = b. Because we are assuming perfectly con¬ 
ducting walls, the electric field is 0 in the material. Thus the boundary conditions 
for the function E x (y) in (14.7) are 

E x (0) = E x (b) =0. (14.11) 


The first condition implies that C 2 = 0 in (14.10); the second condition implies 
that sin vb = 0, so vb is an integer multiple of it. Therefore the solution to the 
differential equation (14.9) with boundary values (14.11) is 


E x (y) = Eq sin , 

where n is a positive integer, and 



(14.12) 


(14.13) 


Equation (14.13), or more specifically the function u> = co(k), is called the disper¬ 
sion relation for these waves. We conclude that the electric and magnetic fields of 
a TE solution are 


E(x, t) = i £ 0 sin e i{kz ~ m,) , 


B(x, t) 


~kEo fnny\ JmzEo /nny\ 
1 -sin I - 1 4- 7.-cos I -1 


J- Sin 

w 


V b 


-) 


+ z - 1 cos I - 

bco V 


b > 


(14.14) 
e i(kz-wt)' (J4.J5) 


We derived these solutions requiring that E(x, t) is translation invariant in x. In 
fact it can be shown 4 that all solutions that are harmonic in time, traveling in the z 
direction and polarized in the x direction, satisfy this requirement. Also, the real 
parts of the complex functions in (14.14) and (14.15) are the physical fields. 

Another boundary condition is that the normal component of B must be con¬ 
tinuous at y = 0 and y = b. That is, since B = 0 in the conducting walls, 


B y (0) = By(b) = 0 (14.16) 

using the notation of (14.2). These conditions are indeed satisfied by (14.15). 


Propagation Characteristics 

The solution (14.14) and (14.15) is called the TE(n) mode. Each field is atraveling 
wave in the z direction and a standing wave in the y direction. The electric field is 

4 See Exercise 3. 
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FIGURE 14.3 Snapshot of the TE(1) mode. The diagrams show schematically (a) the 
electric field, and (b) the magnetic field and current density, for a TE(I) wave between 
planar boundaries of perfect conductors, at an instant of time. The wave travels in the z 
direction, and the wavelength is X = 2k/ k. A dot indicates a vector pointing out of the 
page, and a cross, one into the page. The curves in (b) are tangent curves of the magnetic 
field. 


polarized in the x direction. The magnetic field has two components: a transverse 
component in the y direction, which oscillates in phase with the electric field, and 
a longitudinal component in the z direction, which oscillates with a 90 degree 
phase shift relative to the electric field. 

Figure 14.3 is a schematic diagram of a snapshot at t = 0 of the electric and 
magnetic fields for one wavelength of the TE(1) mode. (A quantitatively accurate 
plot is provided in Fig. 14.5.) In Fig. 14.3 the wavelength in the z direction is 
X = 2n/ k. The electric field alternates in the ±x direction, and the magnetic field 
curls around the changing electric field. The wave propagates in the z direction, so 
at a later time t the fields in Fig. 14.3 will be translated in the z direction by cot/k. 
What this diagram cannot illustrate is the. field magnitudes as functions of y and z. 
E x is maximum at the center and 0 at the boundaries; B y has the same pattern. 
B z is maximum at the boundaries and 0 at the center. 

The dispersion relation (14.13) determines the wave velocities. For electro¬ 
magnetic waves in an infinite vacuum the phase velocity and group velocity are 
both equal to c = 3 x 10 8 m/s. The TEM mode for Fig. 14.1 is the same. But the 
TE modes are much different, because the frequency u> is not a linear function 
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of the wave vector k ; the phase and group velocities differ, and depend on the 
frequency. 

The phase velocity is co/ k, because a constant phase point in the factor 
e'(kz-mt) moves i n the z direction with velocity Sz/St = co/k\ thus 

CO cco 

Vph= T = 7= (14.17) 

k y/a?- — ( mtc/b ) 2 

The phase velocity is always greater than c, the speed of light in free space. 5 The 
phase velocity approaches c in the limit of high frequencies, but it approaches 
ooasw decreases to the value mtc/b. Also, there is no propagation in the TE(n) 
mode for co < mtc/b , because for co < mtc/b the wave vector k determined from 
(14.13) would be imaginary; in this case the fields would decay exponentially as 
a function of z rather than propagating as a wave. The parameter co n = mtc/b 
is called the cutoff frequency of the mode TE(n). The absolute cutoff frequency 
of TE waves, below which no TE wave can propagate between the planes, is the 
cutoff frequency co\ of the TE(1) mode, 


(^cutoff — , ■ (14.18) 

b 

(The TEM mode propagates for any co, all the way down to 0.) 

The group velocity of TE waves in the space between the conducting planes is 

dco c 2 k / " ' 

Vor = -jr- = - = cJ\ - (mtc/cob) 2 . (14.19) 

& dk co v 

(Note from (14.13) thslcodco = c 2 kdk.) The group velocity is the signal velocity, 
and also the velocity of energy transport. It is less than c for any frequency, and 
it approaches 0 as co approaches the cutoff frequency mtc/b. Figure 14.4 shows 
plots of u p h and Ugr versus co. 

The Surface Current 

The longitudinal (z) component of B(x, t) is discontinuous at the conducting sur¬ 
faces. The discontinuity of this tangential component implies the existence of 
surface currents in the conducting planes. Recall the boundary condition on B t , 

B t2 - B t i = Mon x K, (14.20) 

where K(x, t) is the surface current density and n is the unit normal vector point¬ 
ing from 2 —r 1, i.e., from the conductor into the space between. For the case 
of a perfect conductor, we set B ,2 = 0 in (14.20) and operate with nx on both 
sides. The result is K = n x B,i /mo, a convenient form for calculating the surface 
currents in this chapter. The normal vector n is + j at y = 0 and — j at y = b; the 

5 At this point it is natural to ask whether the result Upp > c conflicts with the theory of relativity. It 
does not. We shall explore this question later. 
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v/c 



FIGURE 14.4 Phase and group velocities, for TE(rc) or TM(n) modes of electromag¬ 
netic waves propagating between parallel planes, as functions of frequency <w. The cutoff 
frequency is a> n = nnc/b. For co < u> n the modes do not propagate. 


direction of B is ±z at the inner surfaces, the sign depending on z and t. Therefore 
surface currents flow in the ± i direction. Because B is 0 in the perfect conductor, 
the discontinuity of B is just the value of B at the surface in the interior. Thus the 
surface current densities are given by 


K = + 


K = 


B z (0 ) ? 
Mo 

B z (b) ~ 

MO 


nJlEo ■ n 

sin (kz — cot) 


= (-D 


Mo bco 
„ nnEo . 


Hobco 


at y = 0, 
sin (kz — cot) i at y = b. (14.21) 


(The imaginary parts have been dropped.) In other words, current waves propa¬ 
gate on the boundary surfaces along with the field wave in the space between the 
planes. The existence of a longitudinal component of B is not surprising once we 
see that there are surface currents, because B must curl around K. Figure 14.3(b) 
shows the surface currents for a snapshot of one wavelength of the mode TE(1). 


Energy Transport 

The final part of the analysis is to find the energy transport of the wave. The energy 
flux is the Poynting vector S = E x B/mo, which has both z and y components. 
To calculate S we must use the physical fields—the real part of the complex wave. 
It is straightforward to show from the fields (14.14) and (14.15) that the Poynting 
vector for the mode TE(n) is 

S(x, t) = —[zsin 2 cos 2 (kz — cot) (14.22) 

Mow L V b / 

? nrr . /nrry\ /nrry\ 1 

+ j — sin cos s * n (kz ~ tot) cos (kz — cot )J . 
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The y component of S, which describes energy transport in the direction from 
one plane to the other, is proportional to sin (kz — cot) cos (kz — cot). This function 
oscillates about 0, and averages to 0 over one period of oscillation, so it corre¬ 
sponds to energy transfer back and forth across the gap. In contrast, there is a net 
transport of energy in the z direction, i.e., the direction of wave propagation. The 
average power per unit area is 


=- r 

2rr Jo 


S(x, t) dt = 




(14.23) 


Integrating S avg over y yields the power per unit of length in the x direction carried 
by the electromagnetic wave 


-J- = [ s avg • idy - ~~J 1 - ( mrc/cob) 2 . 
dx Jo 6 4 fi 0 c v 


(14.24) 


The power carried by the wave is proportional to the group velocity, 6 so it tends to 
0 as co decreases to the cutoff frequency nnc/b. This result shows that the group 
velocity is the velocity of energy transport. 

Figure 14.5 is a quantitatively accurate diagram of the energy flow for the 
TE(1) mode, in the form of a snapshot at t = 0 for one wavelength of the 



FIGURE 14.5 Snapshot of the energy flux of the TE(1) mode in the space between 
parallel planar boundaries of perfect conductors. The wavelength X is equal to the sepa¬ 
ration between the planes b in this figure. The solid curves are everywhere tangent to the 
Poynting vectors, and the dashed curves are tangent to the magnetic field vectors. 

6 See Exercise 1. 
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mode. The solid curves are the tangent curves of the field of Poynting vectors. 
The magnetic field is also illustrated; the dashed curves are the tangent curves of 
the magnetic field vectors. (Note that S is everywhere orthogonal to B.) This plot 
is for the special case X = b. 

Phase Velocity Greater Than c 

It is sometimes stated, incorrectly, that “nothing can travel faster than c, the speed 
of light in vacuum.” We have just seen an example of something that does travel 
faster than c: the constant phase point of a guided wave. It is true that no massive 
particle can travel faster than c, but that’s different. Also, no signal carrying infor¬ 
mation can travel faster than c, but that’s also different. The constant phase point 
is not a carrier of information. 

To pursue this idea a little further, consider a pulse of electromagnetic radia¬ 
tion. The pulse carries information, because it might be, for example, part of a 
message in Morse code; or it might be a signal sent at some specific time. By 
Fourier analysis, the pulse may be considered to be composed of infinite trains of 
waves with a broad spectrum of frequencies. If the pulse is emitted into free space 
then all the component waves travel at speed c (the phase velocity) so that the 
pulse also travels at speed c (the group velocity) and arrives at its destination with 
the same shape with which it was emitted. In this case the information associated 
with the pulse has traveled with speed c. But suppose instead that the pulse is 
emitted into a parallel-plane transmission line such as we have been considering. 
Then the pulse will change as it travels, because of dispersion. Low frequencies 
cannot propagate in the bounded region. Modes with frequency above the cut-off 
propagate in the waveguide, but each mode travels with a different phase velocity 
that depends on frequency. All the phase velocities are greater than c, as we have 
seen. However, no information is carried by the individual infinite wave trains. In¬ 
formation can only be transmitted by some identifiable feature of the pulse, such 
as the point of maximum amplitude. Such features travel with the group velocity, 
which is less than c? 

There is no conflict between the theory of relativity and the fact that the con¬ 
stant phase point travels faster than c. Figure 14.6 shows an analogous situation— 
ocean waves breaking on a beach. The waves travel at speed v and approach the 
shore at angle a. The point on the wave where the wave encounters the shore 
travels down the beach with speed v/ cos a, which is greater than the speed of 
the wave in open water. This example is analogous to the motion of the constant 
phase point of a TE wave between parallel planes; in both cases a certain phase 
point travels faster than the free wave speed. 

The TE Wave as a Sequence of Reflections 

We derived the TE wave fields (14.14) and (14.15) by a straightforward mathe¬ 
matical approach—solving the boundary value problem for Maxwell’s equations 
in the region between the planes. It is also interesting to understand the solution 
in another way. The electric field (14.14) is a traveling wave in z and a standing 

7 Good discussions of group and phase velocities can be found in Refs. [3] and [4], 
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FIGURE 14.6 Waves on a beach. The wave speed is v , the distance between crests is A, 
and the waves approach the beach at an angle a. In the time it takes for the wave to travel 
a distance A, the point where the wave breaks travels distance A/cos a to the left along the 
beach. Therefore the point where a wave encounters the shore travels down the beach at 
speed v/cosa. 


wave in y. But a standing wave is a superposition of traveling waves in opposite 
directions. Therefore (14.14) is a superposition of two waves with wave vectors 
k + and k_ given by 

» nir » 

k±=*z±— j; (14.25) 

b 

specifically, 

E(x, t) = E + e , ' (k +* x - firf) + E_e i(k - ,x “"'\ (14.26) 

where E± = ± i Eq/H. In any small section of the wave guide the two terms in 
(14.26) may be interpreted as an incident wave plus a reflected wave. The wave 
vectors k + and k_ have equal components along z and equal but opposite compo¬ 
nents along j. This relationship between k + and k_ describes specular reflection 
from the planes y — 0 or y = b. 

The individual terms in (14.26) have the same properties as waves in free space. 
The dispersion relation (14.13) implies that 

= c|k+| and o> = c|k_|. (14.27) 

The vector amplitudes E + and E_ are orthogonal to the respective wave vectors 
k + and k_. We may also write the magnetic field as a superposition of waves with 
the wave vectors k±, as 

B(x, t) = B + e i(k +* x -^ ) + B_e , ' (k -' , -® ,) . (14.28) 

By then applying V x E = —3B/3 1 we obtain for the vector amplitudes 



(14.29) 
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The amplitudes B + and B_ are orthogonal to E + and E_, and also orthogonal 
to the respective wave vectors k + and k_. Furthermore, the ratios |B + |/|E+1 and 
|B_|/|E_| are both 1/c. So in all respects the separate electromagnetic waves 
with wave vectors k + and k_ are like waves in free space. 

By writing the guided wave fields as the superposition in (14.26) and (14.28) 
we justify the heuristic picture of an electromagnetic wave reflecting back and 
forth between the planes, as illustrated in Fig. 14.7. However, the “ray” picture in 
Fig. 14.7 does not do complete justice to the phenomenon of wave motion in a 
wave guide, because it does not illustrate the interference between the superposed 
waves. Nevertheless, a nice point of this development is that it explains why the 
phase velocity of the guided wave is greater than c. Either of the superposed 
waves in (14.26) has phase speed c, but the phase velocities are in the directions 
of k + or k_ rather than the direction z. The superposition is a traveling wave in 
the z direction with phase speed c/ cos a, where a is the angle between k + or k_ 
andz. 8 

Figure 14.5 shows a snapshot of the energy flow for the pure TE(1) mode, for 
the case X = b. The energy flows back and forth between the planes, with the 
average flux in the z direction. This plot is consistent with the heuristic picture of 
the wave motion as a sequence of reflections. 

Equation (14.22) shows that the ratio of the component of energy flux across 
the gap S y , to the component in the direction of propagation S z , is proportional 
to X/b, where X = In / k is the wavelength parallel to the planes. If the wavelength 
is much shorter than b then the energy flow is strongly directed parallel to the 
planes; this is the case of high frequencies. In contrast, for long wavelengths there 
is strong energy flow back and forth across the gap. The cutoff frequency has 
X = oo, and in this limit all the energy flow is perpendicular to the bounding 
planes. 


8 See Exercise 2. 
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14.1.3 ■ TM Waves 


For the TEM mode, E is normal to the conducting planes and B is tangential; see 
Fig. 14.2. For the TE modes E is tangential and B has both normal and tangential 
components; see Fig. 14.3. For the TM modes B is again purely tangential, as well 
as transverse to the direction of propagation. So, in this case the solutions take the 
form 

B(x, t) = i B x (y)e ,(kz -° Jt) . (14.30) 


However, unlike the TEM mode, B varies with y. 

Gauss’s Law (V • B = 0) is satisfied by (14.30) for any B x (y). In a region of 
vacuum B must also obey the wave equation V 2 B—c _2 3 2 B/3t 2 = 0. Substituting 
(14.30) into the wave equation we find that B x (y) must satisfy 

B"(y) = -Y 2 B x (y) (14.31) 

where y 2 = co 2 /c 2 — k 2 . (Again, the prime indicates differentiation with respect 
to y.) Therefore B x (y) is a linear combination of cosyy and sinyy. Another 
restriction on B is the boundary condition that the normal component must be 0 
at the planes y = 0 and y = b\ but this is always true for (14.30) because B is 
parallel to the planes. 

The further constraints on B x (y) come from the boundary condition on the 
tangential component of the electric field. The form of E is determined by the 
Ampere-Maxwell Law 


V x B = 


1 3E 

c 2 3 1 



E, 


which implies 


E(x, 0 = j kB x (y) - iiB x (y)j 


C fiikz-cot) 
CO 


(14.32) 


The tangential component E : must be zero at the boundary surfaces, so 


B' x (0) = B' x (b) = 0. 


(14.33) 


The solution of the differential equation (14.31) and boundary conditions (14.33) 
is 

B x (y) = Bq cos > (1434) 

where n is an integer and y = mx/b. The solution with n = 0 is precisely the 

TEM mode derived earlier. The TM modes have n = 1,2, 3,_ 

Summarizing, the electric and magnetic fields of the TM(n) mode are 


B(x, t) = 

- /nit y\ 

iS » cos (-r) 

£ i(kz-<ot) 



(14.35) 

E(x, t) = 

? /nny ' 

-j cos I —— 

\ „ ink 

+ sin 1 


° 2kB 0 c i(kz-wt) 

(14.36) 

V b / 

> 2b 

\ b > J 

CO 
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FIGURE 14.8 Snapshot of the TM(1) mode, f or propagation between parallel conduct¬ 
ing planes. The diagram shows schematically (a) the electric field and charge density, and 
(b) the magnetic field and current density. The charges and currents reside on the bounding 
surfaces at y = 0 and y = b. 


The fields of the TM(1) mode are shown schematically in Fig. 14.8. This figure is 
a snapshot at t = 0 of one wavelength (A = 2ir/k) of the wave. As forTE waves, 
the TM waves may be regarded as a sequence of reflections. 

The dispersion relation for the TM(n) mode is o? = c 2 k 2 + ( turc/b ) 2 , which 
is just the same as for the TE(n) mode. Therefore the phase and group velocities 
are also the same, as shown in Fig. 14.4. 

From Fig. 14.8(b) we see that B t is discontinuous at the boundary planes. 
Therefore there are surface currents with density K given, for the TM(1) mode, by 


MoK = +j x B = —zBocos(kz — cot) at y = 0, 

MoK = — j x B = —zBocos(kz — cot ) at y = b\ 

these surface currents are shown in Fig. 14.8(b). From Fig. 14.8(a) we see that E n 
is discontinuous at the boundary planes. Therefore there are surface charges with 
density a given, for the TM(1) mode, by 
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tr/eo = + j E = - 


c 2 kBo 


cos (kz — cot) at y = 0, 


fj/e 0 = -j E = - 


c 2 kB o 


cos (kz — cot) at y = b; 


these surface charges are shown at t — 0 in Fig. 14.8(a). A wave of charge and 
current propagates on the surfaces along with the electromagnetic fields in the 
interior. 

The Poynting vector for the TM(n) mode is 


c 2 kBx r„ , /nny\ 

S(x, t) =- zcos z (-I cos Z (kz — cot) 

unco L \ b / 


(14.37) 


~n\ / nTty\ . / mty \ „ s . ,, 

— j — cos \ ) sm \ “J cos (kz - cot) sin(xz — cot) . 

Figure 14.9 shows a snapshot at t — 0 of the energy flux for one wavelength of 
the mode TM(1) forparameters with X — b. The average energy flux is 


c 2 kB2 ~ 

Savg — COS 

IfJiQCO 




(14.38) 


y/b 



FIGURE 14.9 Energy flux of a TM(1) wave in the space between planar boundaries 
of perfect conductors. The wavelength X is equal to the separation between the planes b in 
this figure. The solid curves are everywhere tangent to the Poynting vectors, and the dashed 
curves are tangent to the electric field vectors. 
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This power density is similar to that of the TE(n) mode (14.23), but there is this 
difference: In a TE mode the power density is 0 at the boundaries whereas in a 
TM mode it is maximum there. 

14.1.4 ■ Summary 

The example in this section is very idealized; the boundary planes are infinite and 
we have only considered waves that are coherently uniform in the x direction. 
Though unrealistic, this example has the value that it is simple enough to analyze 
in detail, and yet it demonstrates some important ways that propagation of electro¬ 
magnetic waves in a space bounded by conductors is different from propagation 
in free space. The different properties of the TE and TM waves in the bounded 
space are these: 

■ There exist discrete modes of propagation, and a given mode is only possible if 
the frequency is higher than a certain minimum value—the cutoff frequency— 
associated with that mode. For a given frequency the number of modes is finite. 

■ Either B or E may have a nonzero longitudinal component. 

■ The dispersion relation is nonlinear, so the phase and group velocities differ and 
depend on the frequency. 

■ The Poynting vector is not always parallel to the direction of propagation, al¬ 
though S avg is. 

■ There are nonzero charge and current densities on the boundary surfaces. 

The same properties also hold for electromagnetic waves in a rectangular wave 
guide, a device with practical applications in microwave technology, to which we 
tum now. 


14.2 ■ THE RECTANGULAR WAVE GUIDE 

We saw n the previous section that electromagnetic waves in a space bounded 
by conductors have particular modes of propagation, each mode having a char¬ 
acteristic vector functional form and specific phase and group velocities. In the 
present section we shall determine the complete set of harmonic modes in a wave 
guide with a rectangular cross section. There are two classes of guided waves— 
TE and TM. There is no TEM mode in a hollow rectangular wave guide. 

We assume the wave guide is much longer in one dimension, which we take to 
be the z direction, than in the other two; then it is a good approximation to imagine 
the guide infinitely long in the z direction. Figure 14.10 shows the bounded space 
in which the electromagnetic waves travel. The cross section has dimensions axb, 
where a is the length in the x direction and b that in the y direction. In the regions 
x < 0 or x > a, and y < 0 or y > b, there is a perfect conductor—a material 
with electric conductivity a = oo. This means that the electric and magnetic 
fields are 0 in these exterior regions. We will assume that the interior of the wave 
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FIGURE 14.10 Geometry of a rectangular wave guide. The exterior is a conductor, 
and the interior is a dielectric. Waves propagate in the z direction. The cross section is a 
rectangle of dimensions a x b. 


guide is vacuum, but air or some other weak dielectric would be essentially the 
same. 9 The boundaries of the wave guide are four planar strips: 

* = 0 or x = a and 0 < y < b, 

y = 0 or y = b and 0 < x < a. (14.39) 

On these surfaces B n and E t must be 0, by the continuity of these field compo¬ 
nents. The boundary conditions play a major role in determining the modes of 
propagation. 


14.2.1 ■ Transverse Electric Modes TE(m, n) 

First we consider electromagnetic waves propagating down the guide in the +z 
direction with E(x, t) orthogonal to the direction of propagation—the transverse 
electric modes. The electric field has the form 

E(x, t) = [i E x (x,y) + j E y (x,y)]e m -°*\ (14.40) 

The various field equations and boundary conditions will determine the functions 
E x (x, y) and E y (x, y). First, Gauss’s Law V • E = 0 implies 


dE x dE y 

1 = 0 . 

dx dy 


(14.41) 


It is convenient to satisfy this equation by introducing a scalar function t/r(x, y) 
such that 


3t lr 3 i/f 

E x =-and E y = +——; 

3y y dx 


obviously then (14.41) is satisfied. In other words, we write 
E = V x (-ife i(kz ~ a,t) } ; 


(14.42) 


(14.43) 


that V • E is 0 follows from the general identity that the divergence of a curl is 0. 
9 Ifthe interior is a dielectric with permittivity e then replace f q in the equations by e, and c by c^/cq/c. 
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The electric field in a region of vacuum also satisfies the wave equation, 


2 1 3 2 E 


where c = l/^/eoMO- Substituting (14.40) and (14.42) into the wave equation we 
find that the equation is satisfied if 


vV — k 2 ijf H—= 0; 
c z 


(14.44) 


that is, 

2 

V 2 i /r = -v 2 i/r where y 2 = ^--fc 2 . (14.45) 

c z 

Equation (14.45) is the Helmholtz equation in two dimensions (2D). We can solve 
this partial differential equation (p.d.e.) by the familiar technique of separation of 
variables. Write i/{x, y) = f(x)g(y). Then the p.d.e. in (14.45) becomes, after 
dividing both sides of the equation by i jr. 


L^L + lfi 

f dx 2 g dy 2 


(14.46) 


The two terms on the left-hand side depend independently on x and y, whereas 
the right-hand side is a constant. Therefore each term on the left-hand side will be 
a constant (call them —/x 2 and — v 2 respectively) so 


d 2 f 2 d 2 g 2 

^r = V/ and =-v 2 g, (14.47) 

and y 2 = n 2 + v 2 . The general solutions of (14.47) are 

f(x ) = cj cos tix -I- C 2 sin /ix (14.48) 

g(y) = C 3 cos vy + C 4 sin vy; (14.49) 


the constants c\ ... C 4 and parameters /1 and v must be determined from the 
boundary conditions. 


Boundary Conditions 

The tangential components of E must be 0 at the four boundaries listed in (14.39). 
First, E y must be 0 at x = 0 and x = a, for all y between 0 and b. Here E y oc 
d\ji/dx = f'(x)g(y) (where the prime means the derivative) so the boundary 
conditions on the walls at .v = 0 and x = a are 

f ( 0 ) = 0 which implies C 2 = 0 , 
f'(a) = 0 which implies sin/ua= 0 . 
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Similarly E x must be 0 at y = 0 and y = b. In this case E x oc —3i ls/dy = 
—f(x)g'(y) so the boundary conditions on the walls at y = 0 and y = b are 

g' ( 0 ) = 0 which implies C 4 = 0 , 
g\b) = 0 which implies sinv£ = 0 . 


The boundary conditions restrict the possible values of /x and v; both \ui and vb 
must be an integer multiple of n, 

mn , tin 

I_l = - and v = —, (14.50) 

a b 

where m and n are non-negative integers. The parameter y in (14.45) is therefore 



(14.51) 


Finally then the function \^(x,y) is 

VK-G y) = ^ocos (—^—) cos ) . (14.52) 

The constant rFo determines the wave intensity. The corresponding electric field 
in the guide is, according to (14.40) and (14.42), 


E(x, t ) 


T cos (—) sin (T-) - j V sin (—) cos (—)] ^ 


(kz—wt) 


(14.53) 


The electric field is the product of a traveling wave in z and standing waves in 
x and y. In other words, the boundary condition E t = 0 requires that an integer 
number (m) of half waves must fit between the walls at .c = 0 and a, and similarly 
(n) between the walls at y = 0 and b. In general E(x, t) has both components E x 
and E y transverse to the direction of wave motion; but for the special cases with 
m or n equal to 0 the electric field has just one of these components. The mode of 
propagation with electric field (14.53) is called the TE(m, n) mode. 


The Magnetic Field of the TE(m, n) Mode 

Given the electric field it is now straightforward to derive the magnetic field from 
Faraday’s Law, V x E = —3B/3r = ico B, i.e., 


B = - 


i 


i 1 z 

d/dx 3/3 y 3/3z 
E x Ey 0 


co 


(14.54) 
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After a short calculation using (14.40) and (14.42) we find 

,2 


k iy‘ „ 

-Vt/r + —ix/f 

CO CD 


B(x, t) = 

Or, substituting the solution (14.52) for \f(x, y) 
B(x, t 


i(kz—tot) 


■ _ cin 1 

/mnx\ 

| COS I 

(nny \ 

1 - Mil | 

a 

\ a / 

l b ) 


- nrt 

+ j —- cos 
b 


z—— cos 
k 


(m7tx\ 


'mxy\ 

V a > 

) sin ( 

) 

(mnx\ 

1 pnc i 

(nny\ 

\ a > 

1 cUa 1 

V b ) 


(14.55) 


— e i{kl ~ wt \ (14.56) 


The magnetic field of a TE mode is not transverse. The ratio of the longitudinal 
component B z to a transverse component B x or B y is proportional to 1 /k = 
A/2 it, where A is the wavelength of the propagating wave. So, the longitudinal 
component is small for short wavelengths but large for long wavelengths. 


Propagation Characteristics 

The dispersion relation for the TE(m, n ) wave is found from (14.45) and (14.51), 


co 2 = c 2 k 2 + 


/mites. 2 / nnc\ 


) M' 


b ’ 


(14.57) 


Because the dispersion relation is nonlinear, the phase and group velocities are 
different. The phase velocity is 


Uph 


co 

I 


cco 




(14.58) 


Note that the phase velocity is always greater than c and that it diverges as co 
decreases to the cutoff frequency co mn defined by 


COmn 



(14.59) 


For co < co mn there is no propagation in the TE(m, n) mode, because then the 
dispersion relation (14.57) would require k to be imaginary. The group velocity 
of the TE(m, n) wave is 


v gr 


dco 

dk 


c 1 k 

co 


= c V /l 


(14.60) 


The group velocity is always less than c, and it approaches 0 as co decreases to 
the cutoff co mn . For high frequencies co co mn both the phase and group velocity 
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approach c. An interesting relation is that 

fphfgr = . (14.61) 

A plot of the phase and group velocities as functions of frequency co would look 
exactly like the earlier case of TE or TM waves between parallel conducting 
planes, shown in Fig. 14.4. 

For any given value of the angular frequency co there are only a finite number 
of propagating TE modes. The mode numbers m and n of a propagating wave are 
limited by the inequality 

co mn < <w. (14.62) 

Discreteness of wave states in a bounded volume is a common feature of waves 
of all kinds. The mode with lowest cutoff frequency is ( m , n) = (1, 0) if a > b, 
or ( m , n ) = (0, 1) if b > a. So, for example, if a > b there can be no wave 
propagation through the wave guide with frequency co < a> cuto ff, where 

JVC 

^cutoff = <t>io = —• (14.63) 

a 

If a > b then the next lowestTE mode is either ( m , n) = (2, 0) or (1, 1), depend¬ 
ing on the relative size of a and b. If the frequency of oscillation co is between co\q 
and the next cutoff frequency then only the mode TE(1,0) can propagate down 
the wave guide. It is common in microwave applications to choose dimensions of 
the wave guide for a given frequency of excitation such that only the mode with 
lowest cutoff frequency can propagate. Any higher modes that might be excited 
by some irregularity will decay exponentially to negligible levels before reaching 
the end of the guide. 

Why do electromagnetic waves with co < a> cuto ff decay when they enter the 
wave guide? (By the dispersion relation (14.57) k is imaginary if co < co mn , so 
then e lkz decreases exponentially with z.) There is no dissipation in this system 
because the conductivity of the walls is oo, so the attenuation is not due to dissi¬ 
pation. Rather, the charge and current densities induced in the conducting walls, 
which we will calculate presently, shield the interior of the wave guide from fields 
outside the guide if the frequency is low. For the special case co = 0, i.e., electro¬ 
statics, this result is the familiar Faraday cage phenomenon. But we have found 
more generally that for co < co mn there is exponential attenuation with distance in 
the wave guide. 

If a radiating antenna is placed in a wave guide, but the frequency is below 
the cutoff, then the energy supplied to the antenna is not converted to propagation 
of electromagnetic waves. Oscillating fields occur in the neighborhood of the an¬ 
tenna, and these create oscillating charge and current densities on the conducting 
walls. But the field and source oscillations are not in phase, and each opposes 
wave propagation (in z) of the other. If co is large then the displacement current 
dominates the charge current and the wave propagates. If co < a> cuto ff then there is 
destructive interference, increasing with z, between the fields of the antenna and 
those of the wall currents, and the wave decays. 



546 


Chapter 14 Wave Guides and Transmission Lines 


The fact that low frequency electromagnetic waves do not penetrate into a re¬ 
gion bounded by conductors explains why an AM radio signal cannot be picked 
up by a car radio inside a metal bridge or tunnel. 10 


EXAMPLE 1 Consider a wave guide with rectangular cross section of dimensions 
10 cm x 5 cm. Which TE modes can propagate for angular frequency An GHz? 
What are the 10 lowest cutoff frequencies for TE modes, and their corresponding 
mode numbers? 

In this example, a = 0.10 m and b = 0.05 m. The cutoff frequency of the mode 
TE(m, n) is 


= (3n GHz) y/m 2 + An 2 . 


(14.64) 


a>/(3n GHz) 


0 + 


(1,2) 

(4, 0) (0, 2) 
(3, 1) 

(3, 0) 

( 2 , 1 ) 

( 1 , 1 ) 

(2, 0) (0, 1) 
( 1 , 0 ) 


FIGURE 14.11 Cutoff frequencies of the TE modes for the rectangular wave guide in 
Example 1. 


The mode TE(m, n) propagates in the guide only if w mn < u>. The only TE mode 
that can propagate with angular frequency An GHz is the TE(1,0) mode. Figure 
14.11 shows the cutoff frequencies of the 10 lowest TE modes, which are, in 
order: ( mn ) = (10), (20) and (01) (degenerate), (11), (21), (30), (31), (40) and 
(02) (degenerate), (12). 


to 


'See Exercise 6. 
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14.2.2 ■ Transverse Magnetic Modes TM(m, n) 

In the previous section we found a set of modes in a rectangular wave guide for 
which the electric field is transverse, i.e., orthogonal to the axis of the guide. These 
modes are complete for transverse electric waves: Any TE wave can be expressed 
as a superposition of the TE modes. But there is another class of waves, for which 
the magnetic field is transverse. In this section we describe the TM modes. 

Following the same line of analysis that we used to construct the TE modes, 
now we write the magnetic field as 

B(x,r) = (-5^ + i 04.65) 

where again \Js(x,y) satisfies the 2D Helmholtz equation (14.45). This form guar¬ 
antees that V ■ B = 0, and that B satisfies the wave equation. The associated 
electric field is deduced from the Ampere-Maxwell Law, 


l (p- _ c^k 

E(m) = -VxB= — 
co co 


iv 

-Vl/r + — Zl/r 


J(kz-cot) 


(14.66) 


All that remains is to impose the boundary conditions. 

It is sufficient to consider the boundary conditions on the normal component 
of B. At the boundaries listed in (14.39) B n must be continuous, which is to say 
0 because B is zero in the perfectly conducting walls. In terms of the function 
i Js(x, y) the boundary conditions are 


d\jf 

3 y 

= 0 

at 

x = 0 

and x = a 

d\j; 

dx 

= 0 

at 

o 

II 

and y = b 


A complete set of solutions to (14.45) subject to these boundary conditions con¬ 
sists of 


y) = 4/0 sin sin , (14.67) 

where m and n are both positive integers. This mode of propagation is called the 
TM(m, n) mode, and its fields are 

r -tot . /mnx\ /nnyx 

B<x ' ,) = r i T sm (~) cos (~r) 

+ j ^ cos sin (^)] (14,68) 


and 
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E(x, t) 


r? mn / 

1 I _ pnc | 

mnx\ 

cin 1 

ntry\ 

l -tua I 

L ci V 

a ) 

Mil 1 

b ) 

- nn . 

/mnx 

1 me 

(tmy\ 

+ j T sin 

V a 

1 LUa 

V b ) 

Jy 2 ■ 

_ 7 cin 

/mnx 

1 cin 

( n tty \ 

Mj Mil 

k 

\ a 

1 Mil 

V b ) 


^*0 joa-utl 

CO 


(14.69) 


The dispersion relation for the TM(m, n) mode is the same as for the TE(m, n) 
mode, given in (14.57). However, there is a major difference between the TM and 
TE modes: The TM mode with lowest cutoff frequency has ( mn) = (11) because 
there is no TM mode with either m or n equal to 0, while the lowest TE mode is 
either (m, n ) = (1, 0) or (0, 1). 


EXAMPLE 2 What are the surface charge densities f or the mode TM( 1,1)? 

The normal component of E on a conducting boundary surface is a/e o, where 
a is the surface charge density. On the horizontal surfaces, i.e., those parallel to 
the xz plane, the normal unit vector is n = + j for y = 0 and n = — j for y = b\ 
at those boundaries 


eo7rc 2 £4'o 

bco 


sin ^^ cos (kz — cot) (horizontal surfaces). (14.70) 


On the vertical surfaces, i.e., those parallel to the yz plane, n = + i for x = 0 and 
n = — i for x = a; at those boundaries 


a = 


eorrc 2 &'I , o 


aco 


sin cos (kz — cot) (vertical surfaces). (14.71) 



FIGURE 14.12 Surface charge distributions for the transverse magnetic mode 
TM(1, 1) of a rectangular wave guide. 


A snapshot of the surface charge distributions for one wavelength of the TM( 1,1) 
mode is shown in Fig. 14.12. This charge distribution propagates as a wave along 
the surfaces in the +z direction. One can easily figure out the surface current 
densities from the relation K = <7v p hZ. 
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14.3 ■ WAVE GUIDE OF ARBITRARY SHAPE 

The rectangular wave guide is most common, but other shapes are possible. In 
this section we analyze harmonic electromagnetic waves in a wave guide with an 
arbitrary cross section. The wave guide is infinitely long in the z direction, and the 
shape of the cross section is independent of z. The boundary surface is a cylinder 
parallel to the z axis. A cross section of the interior region is bounded by a closed 
curve C parallel to the xy plane. For simplicity we assume ideal conditions: The 
exterior of the guide is a perfect conductor and the interior is vacuum. The basic 
equations for the rectangular wave guide can be generalized to an arbitrary cross 
section. 

TE modes. The fields for a TE wave propagating in the +z direction may be 
written in the form 

E(x, t) = -V x {zfe i{kz ~ m,) \ = - [V x (zi Jr)] e i(kz ~ a,,) (14.72) 

B(x, t) = - -Vt/r + — Zt/r e i(kz - m \ (14.73) 

on k 

where i (r(x, y) is a scalar function independent of z. These forms are the same 
as (14.43) and (14.55) used in the analysis of the rectangular wave guide. All 
four Maxwell equations are satisfied if i/r (x, y) is a solution of the 2D Helmholtz 
equation 

V 2 f = -y 2 i/ (14.74) 

with co 2 /c 2 = k 2 + y 2 . 11 All that remains is to impose the boundary conditions. 

The normal component of B must be 0 on C, the boundary curve of a cross 
section of the guide. What is this condition in terms of \jr(x, y)l By (14.73) the 
normal component of B at a point on the surface is proportional to n • Vi/r, where 
n is the unit normal vector at the point in the plane of C; thus 

n-Vi/r=0 on C. (14.75) 

The tangential components of E must also be 0 on C, but that requirement leads 
to the same condition (14.75). 

For the TE modes then, i (r(x,y) obeys the 2D Helmholtz equation in the region 
enclosed by C, with normal derivative 0 on the boundary. Equation (14.74) is an 
eigenvalue problem, with operator V 2 and eigenvalue — y 2 . That is, i/r (x, y) is the 
eigenfunction of the 2D Laplacian with the Neumann boundary condition (14.75). 
There are an infinite number of discrete eigenstates. If the angular frequency co of 
the field oscillations is greater than cy then the solution describes a propagating 
wave, because k is real in that case. But if cd is less than cy then the solution 

n 


See Exercise 10. 
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decays exponentially with z, and the fields in that case are called an evanescent 
wave. 


TM modes. The fields of the TM modes are 
B(x, t) = -V x 


c 2 k 

E(x, t) = — 

co 


X7 / l Y ~ I 
Vl/f- Z1 j/ 

k 


J(kz-cot) 


(14.76) 

(14.77) 


where again i/r(x, y) is a solution of (14.74). The tangential component of E must 
be 0 on the boundary curve C. In particular, the z component in (14.77) must be 0, 
so for TM modes 


f = 0 on C. (14.78) 

The same condition implies that the other tangential component of E, and the 
normal component of B, are also 0 on C; these components are both proportional 
to (z x n) - Vi [r, i.e., the tangential derivative of i J/, which is 0 because i/r is 
constant along C. 

The function i/r(x, y) for a TM mode is an eigenfunction of the Laplacian for 
the Dirichlet boundary condition (14.78) with eigenvalue — y 2 . This eigenvalue 
problem is mathematically equivalent to the transverse vibrations of a membrane. 
For the analogue membrane, i J/(x, y) is the displacement from equilibrium of 
the point on the membrane at (x, y); the boundary condition (14.78) is that the 
membrane is fixed along the curve C like a drumhead. As in the TE case there are 
an infinite number of discrete eigenstates. 

TEM modes. Under what circumstances does a TEM mode exist? Looking at 
B for the TE modes (14.73) or at E for the TM modes (14.77) we see that these 
fields are transverse if and only if y = 0. So the partial differential equation for a 
TEM mode is 


vV = 0, 


(14.79) 


i.e., Laplace’s equation. 

We can now prove the following theorem: There are no TEM modes in a hollow 
conducting wave guide of the kind we have been considering—one that has a 
single closed curve C as the boundary of the cross section. This theorem is a 
corollary of the uniqueness theorem for Laplace’s equation. The unique solution 
to (14.79) for either boundary condition (14.75) or (14.78) is i/r(x, y) = 0; that 
is, the fields are 0. 

On the other hand there can be TEM modes if there is more than one conductor 
in the system, because then Laplace’s equation and the boundary conditions can 
be satisfied with i/r ^ 0. We saw an example of this case in Sec. 14.1, TEM waves 
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between parallel conducting planes. There the boundary of a cross section of the 
propagation space consists of two infinite lines, y = 0 and y = b. The TEM 
solution (14.3) and (14.4) corresponds to the TE fields (14.72) and (14.73) with 
\js(x, y) = E().x and y = 0. 12 This \j/(x, y) obeys Laplace’s equation and the 
Neumann boundary condition (14.75) at y = 0 and b. The uniqueness theorem 
is circumvented by the fact that the region is unbounded in x. Another example 
with a TEM mode, in which the propagation space is finite, is the coaxial cable 
transmission line, to be discussed in the next section. 

Finally, recall that the dispersion relation for waves in the cylinder is at 2 = 
c 2 (k 2 + y 2 ). A TEM mode (y = 0) has at = ck, the same dispersion relation as 
electromagnetic waves in free space. Therefore any TEM mode propagates with 
wave speed c and without dispersion. 


14.4 ■ THE TEM MODE OF A COAXIAL CABLE 

A transmission line is used to carry a signal in the radio or microwave frequency 
range from a transmitter to a receiver, or to an antenna, or to some other device. 
In analyzing a transmission line one can either focus on the current and voltage 
signal, or the wave signal, but the two are intricately coupled. One of the most 
common transmission lines is the coaxial cable, familiar in computer networks 
and cable television. The cable consists of two concentric conducting cylinders 
separated by an isotropic dielectric insulator. We can either analyze the electro¬ 
magnetic wave in the dielectric, or the current and voltage on the conductors. 

The geometry is defined in Fig. 14.13. The axis of the cable is defined to be the 
z axis. So that end effects are negligible, the cable is taken to be infinitely long. 
The radius of the inner cylinder is a, and the inner radius of the outer cylinder is b. 
The regions r < a and r > b are conductors, which we’ll take to have infinite 
conductivity. The waves propagate in the region a < r < b, where we’ll say the 
dielectric permittivity is e. 



FIGURE 14.13 Coaxial cable. Electromagnetic waves propagate in the space between 
the cylinders, which is filled with a uniform dielectric. Charge and current waves move 
on the cylinder surfaces. 

12 Alternatively the TEM mode corresponds to the TM fields (14.77) and (14.76) with i k (x, y) = B^y 
and y = 0. 
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The coaxial cable has a TEM mode, and this mode of propagation is the most 
important for signal propagation in practical applications. The purpose of this sec¬ 
tion is to derive this solution of the field equations. It resembles closely the TEM 
mode between parallel conducting planes described earlier, but with the planes 
rolled up into concentric cylinders. We shall use this similarity to help construct 
the solution. To solve the field equations it is most natural to use cylindrical coor¬ 
dinates. 

By the cylindrical symmetry, the electric or magnetic field of a wave prop¬ 
agating in the z direction is a vector function of r and <p times e ‘ (kz ~ 0J '\ The 
field directions of the TEM mode are orthogonal to z, so E and B may have radial 
and azimuthal components. In analogy with the TEM mode of parallel conducting 
planes, the electric field points from one conductor to the other, i.e., in this case ra¬ 
dially. Surface charges on the cylinders will satisfy the boundary condition of the 
normal component of E. Also, B must be purely azimuthal, i.e., circling around 
the z axis, because from the general analysis in Sec. 14.3 B and E are orthogonal. 
Surface currents will satisfy the boundary condition of the tangential components 
of B. By the cylindrical symmetry, the field magnitudes must be independent of 
the azimuthal angle <p. Putting together all these symmetry considerations we are 
led to a solution of the form 

E(x, t) = r E r (r)e i(kz ~ w,) , (14.80) 

B(x, t) = j)B<p(r)e i ( kz - a>,) . (14.81) 

We may proceed in various ways, but perhaps the most straightforward is to re¬ 
turn to first principles by substituting (14.80) and (14.81) directly into Maxwell’s 
equations. 13 The divergence of E is 

V -E = (rE r ) = 0. (14.82) 

r dr 

The divergence of B is always 0 for any field of the form (14.81). From the diver¬ 
gence equations we leant that rE r is constant, 

E r (r) = (14.83) 

r 


The curl of E is 


V x E = 4>ikE r {r)e i(kz - ( ° ,) (14.84) 

which, by Faraday’s Law, must equal 

-3B /dt = ^i(wB^(r)« ,(fcz -" ,) ; (14.85) 


13 Alternatively we could use the general results of Sec. 14.3; see Exercise 12. 
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that is, 


B^(r) ■ 


kC 

cor 


(14.86) 


The curl of B is 


V x B = -r ikB <t> (r)e i{kz - (0,) (14.87) 

which, by the Ampere-Maxwell Law, must equal 

= -ri/J- 0 ecoE r (r)e ,{kz ~ ai,) ; (14.88) 

dt 


that is, 


B^r) = 


flQ60)C 

kr 


(14.89) 


Together the curl equations show that the dispersion relation for the TEM mode 
is the same as for electromagnetic waves in an unbounded dielectric, 


co = vk where v = 1 /v'Mot- (14.90) 

Also, the ratio |B|/|E| is 1 /v. Of course these results are in accord with the dis¬ 
cussion in Sec. 14.3 of the TEM solution for a wave guide with arbitrary shape. 

The potential difference V(z , t) between the inner and outer cylinders at posi¬ 
tion z along the cable is, by definition, 


V(z,t)=f E • ? dr. 


(14.91) 


Substituting the TEM electric field, we find 

V(z,t) = V 0 e iikz - co,) where V 0 = C\n(b/a). (14.92) 

In terms of the amplitude Vo of oscillation of the potential, the fields of the TEM 
mode are 


E(x, t) 


rV 0 i(kz-cot) 

rln(b/a) 


B(x, t) 


__ e i(kz-cvt) 

vr In (b/a) 


(14.93) 

(14.94) 


So far we have ignored the boundary conditions. Taking the cylinders to be 
perfect conductors, E and B are 0 in the conducting regions. Therefore B n and E t 
must vanish at the boundary surfaces. But these components of B and E are 0 ev¬ 
erywhere for (14.80) and (14.81), so these boundary conditions are automatically 
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satisfied. On the other hand, E n and B t are discontinuous at the boundaries. Their 
discontinuities determine the surface charge and current densities: 


a =eE a 


and 


1 „ 

K = —n x B t = 
Mo 


+ eV 0 J(kz-cot) 

a In (b/a) 

for 

r = a 

~ eV 0 J(kz-tol) 
b\n(b/a) 

for 

r = b, 

+iveV 0 Jlkz _ at) 
a In (b/a) 

for 

r = a 

-iveVo j (kz - Mt) 
bln(b/a) 

for 

r = b. 


(14.95) 


(14.96) 


Thus charge/current waves propagate on the surfaces of the coaxial cylinders. 
Figure 14.14(a) shows the electric field and charge distribution, and Fig. 14.14(b) 
shows the magnetic field and current distribution for the TEM mode of a coaxial 
cable. 


Current and Impedance 

The total currents in the inner and outer cylinders, calculated by integrating the 
surface current density K around the circumference, are equal but opposite. Let¬ 
ting I (z,t) be the total current at z on the inner cylinder, 

I(z,t)= f K ■ z ad<t> = 2 * V u € ^ -e i ( kz - (0,) . (14.97) 

J o In (b/a) 

At a fixed position z on the cable there is an alternating current in the ±z direction 
with angular frequency co. At the same position the potential difference between 
the cylinders is given by (14.92). The impedance Z of the coaxial cable is defined 
as the ratio of V (z, t) over I (z, t), so 

V(z,t) Ifi o In (b/a) 

Z = -j-Ll = / — —. (14.98) 

/ (z, t) ye 2k 

For example, if the polarizability of the dielectric material between the cylinders 
is negligible, so that e = eo, then the impedance is 

Z = (60 £2) In (b/a). (14.99) 

The coaxial cable also has TE and TM modes, which could be determined 
from the general considerations of Sec. 14.3. However, in common applications 
the cable is designed to carry only the TEM mode, so its dimensions are small 
enough that only the TEM mode is a propagating mode. 
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FIGURE 14.14 The TEM mode of a coaxial cable, (a) Electric field and charge densi¬ 
ties on the conducting surfaces, (b) Magnetic field and current densities. 


14.5 ■ CAVITY RESONANCE 

If a finite length L of wave guide is closed at both ends by reflecting walls, the 
result is an example of a resonant cavity. Suppose the cross section of the wave 
guide is bounded by a closed curve C. We already know solutions for an un¬ 
bounded wave guide—TE or TM waves traveling in the +z or — z direction. At an 
end wall the fields can be resolved into incident waves and reflected waves, which 
interfere with each other. For simplicity assume the ends are perfect reflectors. 
Then if the length L is an integer number of half wavelengths, standing waves 
will be set up along the z direction with nodes of some field at the walls. 

For example, considerthe TE modes. The fields for traveling waves are (14.72) 
and (14.73) 
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E(x, r) = - [V x (zi Jr)] e ±ikz e- iwl (14.100) 

(14.101) 

where r/r(x,y) is the eigenfunction of V 2 with eigenvalue —y 2 . Here E is orthog¬ 
onal to z so it is tangential at the end walls. To satisfy the boundary condition 
E t = 0 for perfectly conducting walls at z = 0 and z = L we need the standing 
wave that results from the superposition 

e + lkz — e -‘ kz = 2 i sin/:z. 


B(x, t) = 


CO 


iy 2 ^ 

TVr/r + -f-z* 
k 


(The factor 2 i can be ignored, because it can be absorbed into the amplitude.) 
This function vanishes at z = 0 for any k, and it vanishes at z = L provided kL 
is an integer multiple of n. The cavity fields are 


E(x, t) = [z x Vi/r] sin e ,a 


B(x, t) = 


ik 


co 


Vi/r cos kz -T —ity sin^z 


(14.102) 

(14.103) 


where N is a positive integer and k = Nn/L. The relation between co and k 
must be co 2 = c 2 (y 2 + k 2 ), the same as for traveling waves, because that relation 
came from Maxwell’s equations independent of the boundary conditions. So the 
frequency of this cavity mode is 


co N = yjc 2 y 2 + ( Nnc/L ) 2 . (14.104) 

There is a set of resonant frequencies u>\, ( 02 , C 03 ,... for each eigenstate f(x, y) 
of the 2D Laplacian. The resonant frequencies are all greater than the cutoff fre¬ 
quency cy for the corresponding wave guide mode. Associated with the fields 
(14.102) and (14.103) are charge and current distributions on the sides and ends 
of the cavity. In a manner consistent with all the field equations, charge oscillates 
back and forth over the surfaces, and E and B oscillate inside the cavity. 

A section of wave guide closed at both ends is just one form of resonant cavity, 
and other forms are possible. Resonant modes of oscillation exist for a cavity 
of any shape. Analytic methods can be used to calculate the modes for simple 
symmetric shapes, like a spherical cavity, but for a complex shape the equations 
must be solved by numerical methods. 

The importance of resonant cavities is that they are used in microwave gener¬ 
ators. Various devices have been developed to produce microwaves. Historically 
the first were vacuum tube resonators—the magnetron and klystron—containing 
electron beams. In a magnetron a bunched electron beam circling in a toroidal 
cavity with conducting walls excites a resonance with the electromagnetic field. 
The resonant frequency depends on the geometry of the cavity and the orbital 
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period of the electron bunches. A magnetron is the source of microwaves in a 
microwave oven. 

Today there are solid-state microwave generators, which convert dc power to 
microwave power. Think of a hand-held cellular phone: The power is provided 
by a battery, but the phone transmits information by microwaves. The common 
design of solid-state microwave generators consists of a diode or transistor os¬ 
cillator circuit that produces the oscillation, combined with a cavity resonator, 
which locks the frequency to the desired value. For radio wave frequencies the 
resonator could be a “lumped circuit,” i.e., made of separate capacitive and in¬ 
ductive components (the “lumps”). But for higher frequencies, in the microwave 
range, the resonator must be an integrated device like a cavity resonator, e.g., a 
shorted wave guide. 


FURTHER READING 

1. J. C. Slater and N. H. Frank, Electromagnetism (Dover, New York, 1969). Chapter XI 
is a brief but clear exposition of wave guides. 

2. M. A. Heald and J. B. Marion, Classical Electromagnetic Radiation, 3rd ed. (Saunders, 
Philadelphia, 1995). 

3. W. C. Elmore and M. A. Heald, Physics of Waves (Dover, New York, 1969). 

4. H. Georgi, The Physics of Waves (Prentice-Hall, Englewood Cliffs, NJ, 1993). 

5. Microwave electronics is a major industry, and some engineering textbooks are entirely 
devoted to this subject: R. S. Elliott, Guided Waves and Microwave Circuits (Prentice- 
Hall, Englewood Cliffs, NJ, 1993). 


EXERCISES 

Sec. 14.1. Waves Between Conducting Planes 

14.1. For the TE(n) mode of propagation in the space between parallel conducting planes, 
show that S a vg = MavgUgr*, where u is the field energy density. Here the subscript 
avg implies an average over both t and y. Explain why this result means that the 
group velocity is the velocity of energy transport. 

14.2. Show that the superposition of waves in (14.26) has phase speed u p h = c/ cos a 
with phase velocity in the direction of z. (a is the angle between k+ or k_ and z.) 

14.3. Consider waves between parallel conducting planes, traveling in the z direction and 
polarized in the x direction. Show that E(x, t) must be independent of x. 

14.4. For electromagnetic waves between parallel planar boundaries of perfect conduc¬ 
tors, show that as co approaches the cutoff frequency of a TE or TM mode the 
wavelength for that mode approaches oo. 

14.5. Verify that the field (14.26) is the same as (14.14). 
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Sec. 14.2. Rectangular Wave Guide 

14 . 6 . Consider a car entering a tunnel of dimensions 15 m wide and 4 m high. Assuming 
the walls are good conductors, can AM radio waves propagate in the tunnel? 

14 . 7 . Suppose the dielectric in a rectangular wave guide undergoes breakdown (sparking) 
at 10 kV/m. Calculate the maximum power that can be carried by the TE(1,0) mode 
without breakdown if the dimensions of the waveguide are 5 cm x 5 cm. Assume 
the frequency is 2 x the cutoff frequency. 

14 . 8 . Consider a wave guide with a square cross section of dimensions a x a. Let the 
z axis be the axis of the wave guide. Suppose the region z < 0 is vacuum, and 
the region z > 0 is a dielectric with permittivity e. Write a solution of the wave 
equations and boundary conditions such that there is an incident and reflected wave 
for z <0 and a transmitted wave for z > 0. All three waves are TE(10) waves. 
Determine the transmitted power as a fraction of the incident power. [Answer: 
Strans/Sj n c = 4 kk'/(k + k') 2 , where k is the wave vector in the region z < 0 
and k' is the wave vector in the region z > 0.] 

14 . 9 . Determine the 10 TE modes of a wave guide with a square cross section with lowest 
cutoff frequencies. Also, what are the lowest 10 TM modes? 


Sec. 14.3. Wave Guide of Arbitrary Shape 

14.10. Prove that all four Maxwell equations are satisfied for the fields in (14.72) and 
(14.73) if f (x, y) is a solution of the 2D Helmholtz equation. (Hint: In calculating 
V ■ B, V x B, and V x E, be careful to take account of the factor e ‘(kz-a>t) ) 

14.11. Determine the TM and TE modes of propagation in a cylindrical wave guide with a 
circular cross section. (Hint: Solve (14.74) by separation of variables in plane polar 
coordinates. The radial functions are Bessel functions regular at r = 0.) Determine 
the first few cutoff frequencies of each type of mode, and note any degeneracies. 


Sec. 14.4. Coaxial Cable 

14 . 12 . Use the general results of Sec. 14.3 to derive the TEM mode of a coaxial cable. 
Start with (14.72) and (14.73), with y = 0. f(x, y) is a solution of Laplace’s 
equation with appropriate boundary conditions. 


General Exercises 

14 . 13 . Consider electromagnetic waves inside a perfectly conducting metal wave guide. 
Assume the interior cross section of the wave guide (parallel to the xy plane) is a 
square of size a x a, and the length of the wave guide along the z axis is infinite. 
For a particular set of TE modes, the electric field oscillates in the x direction 

E(x, 0 = i E(.y)e i{kz ~ , ° ,) 


where E{y) depends only on y. 
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(a) For a given value of k determine the possible values of co. 

(b) Sketch a graph of phase velocity as a function of k for the first few modes. 
Mark the speed of light in vacuum c on the velocity axis. 

(c) Sketch a graph of group velocity as a function of k for the first few modes. 
Mark c on the velocity axis. 



CHAPTER 



Radiation of 
Electromagnetic Waves 


In this final chapter we study the process of radiation. Radiation means the gen¬ 
eration of electromagnetic waves, which then propagate away from the source to 
infinity. Classically, electromagnetic waves are created by charged particles accel¬ 
erating, or, equivalently, currents varying in time. In Chapter 12 we determined 
the electric and magnetic fields of a point charge moving with constant velocity. 
There is no radiation in that case; the energy in the electromagnetic field is trans¬ 
ported along with the particle, but does not propagate away toward infinity. But if 
the charge velocity is not constant there must be radiation. 

As we start the study of radiation, it is interesting to remember thatthe classical 
field theory was developed from experiments and observations on macroscopic 
objects such as charged bodies, insulators, conductors, current-carrying wires, and 
magnets. The classical theory is therefore at its best when applied to macroscopic 
electromagnetic phenomena. So, for example, the theory applies to radiation of 
radio waves by an antenna, or microwaves by a cavity resonator. In this book we 
concentrate mainly on systems of that kind. 

However, much of the radiation around us—in fact much of the radiation in the 
Universe—comes from microscopic systems and must be described by quantum 
electrodynamics. For example, sunlight, light from the filament of an incandescent 
bulb, fluorescent light, or laser light can only be explained by quantum consider¬ 
ations of individual atoms, molecules, or systems of atoms. For these systems the 
classical theory does not really apply, except perhaps in a qualitative, heuristic 
way. Only quantum electrodynamics can properly describe radiation by an atom. 
Despite its limited applicability, the classical theory of radiation is an important 
part of electromagnetism. Quantum electrodynamics, which has limitations of its 
own, relies on a foundation of classical electrodynamics. 

A significant application of radiation theory is the analysis and design of anten¬ 
nas. A common radio antenna carries an alternating current. This time-dependent 
current, composed of charges with nonzero acceleration, creates electromagnetic 
waves. Some type of modulation of the alternating current encodes the informa¬ 
tion carried by the wave. In AM radio the amplitude is modulated, while in FM 
radio and in many cell phones the frequency is modulated. The receiver circuit 
decodes the modulation when the radio wave is picked up by the receiving an¬ 
tenna. 

In the first section of this chapter we construct the potential functions for arbi¬ 
trary time-dependent charge and current densities. In the later sections we use the 
general results to analyze some specific radiating systems. 
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15.1 ■ THE RETARDED POTENTIALS 

The behavior of electromagnetic fields interacting with charged particles is gov¬ 
erned by the Maxwell equations. It is sufficient for our purposes to consider fields 
in vacuum, i.e., systems without polarizable materials. Then the field equations 
are 


V . E = p/e 0 and V x E = —3B/3r (15.1) 

V • B = 0 and V x B = /to (J + e 0 3E/3r). (15.2) 

We simplify the mathematics by introducing vector and scalar potentials, A(x, t ) 
and V(\, t). Recall from Sec. 11.2 that the fields are written as 

B = V x A and E = —dA/dt — W. (15.3) 


Also, it is most convenient in radiation theory to impose upon A(x, t) and V (x, t) 
the Lorentz gauge condition, 


V A = - 


1 3V 
c 2 3 1 


(15.4) 


there is no loss of generality in making this requirement, because it can always 
be satisfied by a suitable gauge transformation if necessary. Following the usual 
notation c is the speed of electromagnetic waves in vacuum, which is equal to 
1 /V^ofo- With the Lorentz gauge choice, Maxwell’s equations imply that A(x, t ) 
and V (x, t) satisfy the differential equations 


2 1 d 2 V 

P 


V 2 y + 

c 2 3r 2 


(15.5) 

_ 1 3 2 A 



- V A + c 2 3r 2 

= MoJ- 

(15.6) 


We met these equations earlier as (11-30) and (11-31). The scalar equation (15.5) 
is called d’Alembert’s equation, the differential operator —V 2 + (l/c 2 )3 2 /3f 2 
is called the d’Alembertian. Each Cartesian component of A(x, t) also satisfies 
d’Alembert’s equation. We shall solve (15.5) for V (x, t ) and (15.6) for A(x, t) by 
constructing the Green’s function of the d’Alembertian. 


15.1.1 ■ Green's Functions 

Our goal is to develop a formula that solves (15.5) for V(x, t) for an arbitrary 
source function p(x, t). The same formula will apply to (15.6). To construct the 
solution we shall analyze (15.5) in several steps. To begin, suppose the charge 
density is constant in time, p = p(x). Then V (x) is independent of time, assum¬ 
ing p(x) is the only source of the potential, and so (15.5) reduces to Poisson’s 
equation 


—V 2 V = p/6 o. 


( 15 . 7 ) 
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In Chapter 3 we studied Green’s solution of Poisson’s equation, 


V(x) = 


i r p(\')d 3 x' 
eo J 47r|x - x'| 


(static case). 


(15.8) 


This formula solves Poisson’s equation because 1 /4nr is the Green’s function 
of —V 2 ; that is, in terms of the Dirac delta function 


47r|x 


= <5 3 (x 


•*'). 


(15.9) 


When —V 2 acts on the integral in (15.8) it gives f <5 3 (x — x!)p(x')d i x ', which 
is p(x) because the delta function is an infinitely sharp spike at x' = x. Hence 
(15.7) is satisfied. In other words, l/(4ttr) is the potential (aside from the con¬ 
stant factor 1/eo) at distance r from a unit point charge. The integral in (15.8) 
is the superposition of the potentials due to elements of the charge distribution, 
integrated over the charge distribution. 

As the next step toward solving (15.5), suppose the charge density is harmonic 
in time with constant angular frequency co. That is, let p(x, t) have the form 


p(x, t) = p(x)e 


(15.10) 


As usual we use complex fields and densities for mathematical convenience, but 
it is understood that the real part of a complex function is the physical quantity. If 
p(x, t) is harmonic for all times then so is V (x, t), and we may write 

V(x, t) = V(x)e~ iwl . (15.11) 


Now we substitute these forms for V and p into d’Alembert’s equation. Note that 
the operator 3/3 1 acting on V (x, t) is equivalent to multiplying by —iw. Thus we 
find that V (x) must obey 

-V 2 V - k 2 V = p/e 0 , (15.12) 

where the scalar parameter k is co/c. Equation (15.12) is the Helmholtz equa¬ 
tion , and it, too, can be solved by the Green’s function method. We just need the 
Green’s function of the operator —(V 2 + k 2 ). 

We shall show that the Green’s function of —(V 2 + k 2 ) is e' kr /(4nr). That is, 

— (v 2 + k 2 ^ - = 47t<5 3 (x — x'), (15.13) 

where r = |x — x'|. Note that if k = 0 this reduces to the static case. To prove 
(15.13) we must verify that the left-hand side has the properties of the delta 
function. Without loss of generality let x' be at the origin. We must show that 
the left-hand side of (15.13) is 0 for r ^ 0 but has integral 47r over a volume 
that includes the origin. It is straightforward to show, in spherical coordinates, 
V 2 (e‘ kr /r) = —k 2 (e' kr /r) for r ^ 0; so the left-hand side of (15.13) is 0 away 
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from the singular point. But by Gauss’s theorem the integral of the left-hand side 
over a spherical volume of radius R around r = 0 is 


d 

|V‘* 1 

47 1R 2 — k 2 4jt 

' -1 r ' 

O “I" 

e ikr 

dR 

R 


_(ik) 2 ik _ 



= 47T. 


The integral is 47T for any R. Thus the left-hand side of (15.13) does have exactly 
the properties of a delta function, and the equation is proven. Using the Green’s 
function, V(\) is 


V(x) = 



e ik\x-x'\ 

4tt|x-x / | 


p(x!)d 2 x l . 


(15.14) 


The operator —(V 2 + k 2 ) acting on the integral gives p(x) because the delta func¬ 
tion is an infinitely sharp spike at x' = x. 

The formula (15.14) solves the potential problem for a source harmonic in 
time. There are many interesting problems with harmonic fields. Equation (15.14) 
and its partner for A(x) may be used to solve such problems. But now we are 
interested in general time-dependence. 

As the final step in the analysis of d’Alembert’s equation, let p(x, t) have ar¬ 
bitrary time dependence. For any time dependence we may express p(x, t) and 
U(x, t) as Fourier integrals, i.e., as superpositions of harmonic functions: 


P(x, t) = 

POO 

/ p(x, co)e~ ic0t dco. 

(15.15) 


/—OO 



POO _ 


V(x,t) = 

/ V(x, co)e~ l(Dt d(o. 

1— OO 

(15.16) 


The fact that any time-dependent function can be written as a Fourier integral 
(with reasonable assumptions about smoothness and finiteness that would be true 
for any physical field) is a powerful theorem of mathematics. Now, by the super¬ 
position principle—applicable here because (15.5) is a linear relation between V 
and p—the component of V (x, t) with frequency co is determined by the com¬ 
ponent of p(x, t ) with the same frequency. That is, V (x, co) is given in terms 
of p(x, co) by (15.14) with k = co/c. Substituting this solution for V(x, co) into 
(15.16) we have 


1 f 00 C e ,a, l x_x, l/ c 

V(x, t) - — / / —- -p(x',co)d 3 x'e~ t<ot dco. (15.17) 

eod-ooJ 47 t|x-x , | 

The co integration in (15.17) can be evaluated. Combine the exponentials e~ ,wt 
and e llJ,r l c as e~ lmt ' where 


t' — t - |x - x'|/c. 


(15.18) 
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Now the co integral can be done by (15.15), giving p(x', t'). Therefore the scalar 
potential is 


V(x, t) = — 
eo . 


- Ix-x'l /c) d 3 x , 

47t|x-x , | * ' 


(15.19) 


This is our general formula for the scalar potential due to a time-dependent charge 
density in the Lorentz gauge. d’Alembert’s equation for the vector potential is 
solved in the same way, by 


A(x, r) = mo 


J(x', t - Ix-x'l/c) ^, 
4tt|x-x'| ‘ ' 


(15.20) 


The potentials (15.19) and (15.20) are called the retarded potentials. 

The meaning of (15.19) or (15.20) is that a potential at point x and time t is a 
superposition of terms, integrated over all source points x'. However, the contri¬ 
bution to the potential from the element of the source at position x', is determined 
by the density p(x', t') or J(x', ?') not at time t but at the earlier time t' given 
by t — r/c. This time t' is called the retarded time. The time difference t — t' is 
equal to the time it would take light to travel from x' to x. In other words, there 
is a time delay, equal to the light travel time, for any action at the source point x' 
to affect the field at x. Field theory is a local theory—the antithesis of action at 
a distance—and it takes time for a change at one point in the system to influence 
the system at another point. The effect of a change in the charge or current density 
travels at the speed of light. 

The idea of the retarded potential was first stated by Riemann for scalar waves. 
It was first applied to electromagnetism by Ludwig Lorenz. By the way, (15.19) 
and (15.20) imply that the Green’s function G(r, r) of the d’Alembertian opera¬ 
tor, defined by 


-V 2 G + 


1 9 2 G 
c 2 3r 2 


= S 3 (r)3(r), 


(15.21) 


is 


G(r, r) = 


<5( r — r/c) 
47rr 


In terms of this time-dependent Green’s function, the potential is 
V(x, f) = — [ G(x - x', t - t')p(\', t')d 3 x'dt'. 

J 


(15.22) 


(15.23) 


As an illustration of the use of retarded potentials, we consider the following 
interesting, although unrealistic, example. 


EXAMPLE 1 Suppose at t = 0 a uniform surface current K(f) = Ko(t) i is sud¬ 
denly established on the entire xy plane. Figure 15.1 illustrates the source and 
waves. What are the radiated wave fields E(x, t ) and B(x, t)1 
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wave propagation 



FIGURE 15.1 Example 1. The xy plane carries a time-dependent surface current density 
K(r) in the x direction. Waves are radiated in the ±z directions. 


First, note that V (x, t) = 0 because there is no net charge. Second, calculate 
the vector potential by replacing J d^x' in (15.20) by K dS' 

A(x, t) = — f -K(x, t - r/c)dS'. (15.24) 

4 7i J r 

Here r denotes the distance |x — x'| from source point to field point. The integral 
may be evaluated by integrating over infinitesimal annuli of current, as shown in 
Fig. 15.2. For a point P at distance z from the plane of current 


z 



FIGURE 15.2 An integration element on a current sheet. The xy plane has surface cur¬ 
rent density K in the x direction. The shaded region is an elemental ring. 
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A(x, t) = — i [ -Ko(t — r/c)2nr'dr', (15.25) 

4 tv J r 


where r 2 = r' 1 + z 2 . 

Now change the variable of integration from r' to r, using r'dr' = rdr. The 
lower limit of r is z for z > 0, or — z for z < 0, therefore we write |z|. The upper 
limit of r is ct because Knit — r/c) is 0 for t — r/c < 0. Thus 

A(x, t) = —i [ K 0 (t - r/c)dr. (15.26) 

2 J\Z I 

The effect of a change of the current propagates at the speed of light. For t < \z\/c 
there is no field at P. At t = |zl/c the field due to the nearest source, at the origin, 
arrives at P. For t > |z|/c all sources with r < ct contribute to the field at P. 

To facilitate the calculation of E and B we make another change of variables, 
to £ = t — r/c , 


The fields at P are 


A(x, t) 


MOc; f 

2 1 Jo 


t-\z\/c 


K o(|)d|. 


oa line 

E(x, t) = —— = ——^~Ko(t T z/c)\ 
at 2 

a 3 A y Ho * 

B(x, t) = V x A = j—— = K()(t T z / c) J 

dz 2 


(15.27) 


(15.28) 

(15.29) 


where the upper sign is for z > 0 and the lower sign is for z < 0. These fields are 
illustrated in Fig. 15.1. 

As an exercise it can be shown that (15.28) and (15.29) represent outgoing 
plane waves, and that the proper boundary conditions are satisfied. 1 


One final comment should be made about the retarded potentials. There are 
also advanced potentials, for which the potential at the spacetime point (x, t ) de¬ 
pends on the source at (x', t"), where t" = t + |x — x'|/c. Note that t" is a later 
time than f; the time t" is called the advanced time. Going back through our 
derivation of (15.19) and (15.20) we would generate the advanced potentials by 
using e~ lkr /(‘Xnr) as the Green’s function of —(V 2 + k 2 ). In (15.14) we used 
e + lkr /(Anr), an outgoing spherical wave, as the effect of the harmonic source; 
whereas e~ lkr /(Anr) describes an incoming spherical wave. Either retarded or 
advanced potentials satisfy the differential equations (15.5) and (15.6). But the 
retarded potentials satisfy the proper boundary condition in time for radiation 
problems: Causes must precede their effects, so the response to a change in the 

1 See Exercise 1. 
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source occurs later, at the retarded time. The advanced potentials are unphysi¬ 
cal solutions, the use of which would imply a violation of causality. Nevertheless 
there are theoretical speculations about electromagnetism in which advanced po¬ 
tentials play a role. 2 


15.2 ■ RADIATION FROM AN ELECTRIC DIPOLE 

Equations (15.19) and (15.20) determine the potentials, and hence the fields, for 
any densities p(x, t) and J(x, t). At this point we may specify a system of cur¬ 
rent and charge and calculate the fields. At large distances from the source the 
asymptotic fields will be electromagnetic waves, propagating outward with some 
angular distribution of intensity. Such a calculation is an example in antenna the¬ 
ory. Whole books are devoted to this important topic in electrical engineering. 

Antenna calculations are somewhat technical and specialized. In this section 
we study the simplest example—radiation from a pointlike electric dipole. Al¬ 
though this case has limited applicability, it is a good pedagogical example be¬ 
cause it is simple enough that the field calculation can be done analytically, but 
complete enough to illustrate the general features of radiation problems. 

In general a source of electromagnetic radiation has both charge density p(x, t) 
and current density J(x, t). It turns out to be simplest to analyze the current den¬ 
sity. J(x', t') denotes the volume current density as a function of position x' in 
the source and time t'. We shall assume that the source occupies a finite vol¬ 
ume around the origin, as shown in Fig. 15.3. Then we shall calculate the asymp¬ 
totic potentials and fields, i.e., at points far from the source. As we shall see, the 
asymptotic fields are of order 1/r where r = |x| is the distance from the origin. 
Therefore we systematically neglect terms of order 1/r 2 or smaller in calculating 


z 



FIGURE 15.3 Asymptotics. Far f rom the source |x — x , | approaches r. A better approx¬ 
imation is |x — x ' « r — ? • x 

2 J. A. Wheeler and R. P. Feynman, Rev. Mod. Phys. 17, 157 (1945), and 21, 425 (1949); J. L. Anderson, 
Am. J. Phys. 60, 465 (1992). 
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the fields. It is characteristic of radiation fields that they fall off as 1/r at large 
distances. The asymptotic region is called the radiation zone. 

The exact vector potential for any field pointx is (15.20). Asymptotically, |x| is 
much larger than |x'| for any source point x'. Therefore |x—x'| approaches r = |x|, 
the distance to the origin, as demonstrated in Fig. 15.3. This approximation is 
accurate enough to determine the asymptotic fields of a pointlike electric dipole. 3 
Thus we start with the relation 

A(x, t) ~ f J(x ; , t — r/c)d^x', (15.30) 

4 nr J 

where the symbol is to be read as “approaches asymptotically”. 4 Again, r = 

|x| is the distance from the origin to the field point x, which is to say the radial 
coordinate of x. To simplify the notation, let t r denote the retarded time for the 
source point at the origin 


t r = t — r/c. (15.31) 

In the approximation (15.30) all source points have the same retarded time t r , 
which is what we mean by a pointlike source. For this approximation to be valid 
the source size a must be small compared to any relevant length in the system. For 
one thing, a must be small compared to r. But more importantly, a must also be 
small compared to c t, where r is the characteristic time constant of the variation 
of J(x', t'). For example, if J(x', t') varies harmonically in time, with angular 
frequency co, then the characteristic time constant r would be the period 2tt/(o. 
In that case, because c r would be the wavelength A of the radiation, we require 
a <JC A. This statement makes precise what we mean by a pointlike source. Or, 
looking at it another way, the condition for validity of (15.30) is that the changes 
occur slowly, with a long characteristic time r a/c. 

Having made the approximation (15.30) the rest of the calculation is just a 
matter of simplifying the expression. First we relate A(x, t) to the electric dipole 
moment of the source. Recall from Chapter 3 the definition of the electric di¬ 
pole moment of a charge distribution, 

p(r) = J xp(x, t)d 3 x, (15.32) 

which we assume is nonzero. (We temporarily drop the prime on the source points 
to simplify the notation; for now x denotes a point in the source.) We shall prove 
that / J d^x is dp/dt. 

Consider the integral, over all space, of V ■ (x, J) where i is a Cartesian index. 5 
By Gauss’s theorem the integral is 0, because J(x, t) is 0 on the sphere at oo. The 
divergence is 7/ + x;V • J by a vector identity; and V • J is — dp/dt by charge 

3 We will use a better approximation to calculate magnetic dipole radiation in Sec. 15.2.3. 

4 The mathematics of asymptotic expansion is used often in radiation theory. We approximate functions 
of x in the limit r —> oo, keeping terms of order r -1 but neglecting terms of order r~ 2 . 

5 The index values i — 1, 2, 3 correspond to the x, y,z components, as usual. 
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conservation. Thus the all-space integral of the current density is 

J Jjd^x — — j Xj V • J d 3 x — j Xipdr’x = (15.33) 

as claimed. Using this result in (15.30) the asymptotic potential is 

A(x, t) ~ (15.34) 

4nr dt 

where the derivative of p is evaluated at the retarded time t r . 

The magnetic field is V x A. Computing the curl of (15.34) gives two terms, 
but one is negligible asymptotically. If V acts on the factor 1/r the result is of 
order 1/r 2 , which we must neglect as before. But there is also an r dependence 
in dp/dt because this function is evaluated at t r = t — r/c. The curl of dp/dt is 
Vf r x d 2 p/dt 2 by the chain rule; and Vt r is —r/c. Thus the asymptotic magnetic 
field is 


B(x, t) ~ —— f x = Bf ad- (15.35) 

4jtrc dt 2 

The notation B rad emphasizes that the asymptotic field is the electromagnetic ra¬ 
diation. 

The asymptotic scalar potential may be determined from the Lorentz gauge 
condition (15.4), 


dv 

JT 


— —c 2 V • A ~ 


c 2 ixq d 2 p / -r \ 
4itr dt 2 \ c ) 


(15.36) 


The asymptotic part of V • A comes from V acting on the r dependence of 
p(f r ), i.e., 


v dpM 

dt 


d 2 p 
~dt 2 




By (15.36) the asymptotic potential is 


(15.37) 


Mo c* dp 

-r • —. 

4nr dt 


(15.38) 


The electric field is —3A/3 1 — VV. We could calculate the asymptotic form 
directly, which we leave as an exercise. 6 However, the result may be derived more 
easily by using properties of electromagnetic waves. Far from the source, E and 
B propagate as an outgoing spherical wave. In any direction the wave approaches 
asymptotically a plane wave. Therefore E ra( j, B rac j, and f form an orthogonal triad 
of vectors; and the magnitudes of the electric and magnetic fields are related by 
B rad = £ rad /c, as for plane waves. These properties of the outgoing waves follow 

6 See Exercise 4. 
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directly from Maxwell’s equations in the vacuum outside the source. Therefore 
the asymptotic electric field E rac j is cB ra d x r. Substituting (15.35), and simplifying 
the double cross product, we find the electric field of the outgoing radiation 


E rad = 

4 nr 


r r 


d -p 

di 2 


d 2 p 

di 2 


(15.39) 


Again, d 2 p/dt 2 is evaluated at the retarded time t r —t — r/c. 

It is important to note that E ra d and B ra d are of order 1 / r. Indeed we have ne¬ 
glected any other r dependence. Static sources always produce fields that decrease 
with a higher power of 1 / r. For example, the electric field of a static dipole is of 
order p/ (tor 3 ). But the electric field of the radiation produced by a time-varying 
dipole is of order p/(eoc 2 r). It is also important to understand the implications of 
this asymptotic behavior. The fields decrease as 1/r, so the energy flux decreases 
as 1/r 2 , as required by conservation of energy. 

The energy flux S (power per unit area) in the radiation zone is E ra d x B ra d//i n- 
The calculation of E ra d x B ra d requires several steps. B ra d is proportional tor xa, 
where a = d 2 p/dt 2 , and E ra d is proportional to ra r — a. The cross-product 
reduction proceeds via vector product identities as follows: 


(ra r — a) x (r x a) = a r r x (r x a) — a x (? x a) 
= a r [far — a] — ra 2 + a r a 
= r (a 2 - a 2 ^ . 


Therefore 


M 0? / ^ 2 p \ 2 

16^ 2 r 2 c ydt 2 J 



(15.40) 


The energy flux is in the direction of r, i.e., radially outward. The magnitude 
includes the factor 1/r 2 as it must by energy conservation: The power through a 
sphere cannot depend on the radius if the radiation rate is constant. Or, whatever 
power passes a radius r\ at time t\ passes any larger radius ri at ti — t\ + (ri — 
r\)/c , carried outward by the fields of the propagating wave. The surface area is 
Airr 2 so the power per unit area must decrease as 1/r 2 . 

The total power of radiation passing through a sphere of radius R is 

P(R, t) = <j)S-rR 2 d£2. 

To evaluate the integral over solid angles, set up a coordinate system with the z 
direction parallel to drpjdt 2 and use polar coordinates; note that f dO. = 47r and 
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P(R,t) 


1 

67TC0 C 3 


d 2 p 
dt 2 


(15.41) 


It is worth stating again that d 2 p/dt 2 in (15.40) and (15.41) is evaluated at t — R/c, 
the time when the radiation was emitted from the origin. So P(R, t) is the power 
at radius R of the radiation that was generated at the retarded time. 

The results (15.40) and (15.41) are general for any pointlike dipole mo¬ 
ment p(f). For example, if the dipole changes only during a short time interval, 
then the radiation is a pulse propagating outward. Another example is continuous 
harmonic radiation, which we consider next. 


15.2.1 ■ The Hertzian Dipole 

As a simple example of a radiation source, consider the harmonically oscillating 
dipole shown in Fig. 15.4, called the Hertzian dipole. An alternating current I (t) 
carries charge between two small spheres on the z axis at z = ±d/2. The charges 
on the two spheres are ±<2(0, where Q(t) = Qq cos <ot. The current in the wire 
connecting the spheres is I = dQ/dt — —toQo sin cot. Thus the electric dipole 
moment, as a function of time t, is 

p(0 = kQ(t)d = kpocos cot (15.42) 

where po = Qod. The derivatives of p, which we need to calculate the radiation 
fields, are 


dp/dt = —kpowsincu? 
d 2 p/dt 2 = —kpoco 2 cos cot. 

A physical realization of the Hertzian dipole is a small parallel plate capacitor in 
a circuit with an alternating emf. 

The electromagnetic waves from the Hertzian dipole have frequency a> and 
wavelength X = 2ttc/o:. We assume X » d so that the dipole is pointlike. 



FIGURE 15.4 The Hertzian dipole. The size of the dipole is assumed to be much 
smaller than one wavelength of the radiation. 
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Then the radiated power is (15.41) and its angular distribution can be calculated 
from (15.40). 

Let r, 6, (p be spherical polar coordinates, with the origin at the dipole. The 
radiation zone for this small dipole is the region r » A. The radiation fields at 
(r, <9, <p) are, by (15.35) and (15.39), 


where 


^rad — Brad4> and Erad — cBx^dO (15.43) 

— LinpnCO 2 Sin0 r . . 

fi rad = -:-COS [u>{t - r/c )]. (15.44) 

4:rc r 


The magnetic field oscillates in the azimuthal direction, and the electric field oscil¬ 
lates in the meridional direction, as illustrated in Fig. 15.5. The Poynting vector is 



MO 


(15.45) 


which is in the radial direction. The energy flux at a point in the radiation zone 
oscillates between 0 and a maximum value .S’ max , with frequency 2 oj. The radi¬ 
ation intensity 2 is defined as the average energy flux—averaged over a period 
of oscillation—so the intensity is I = S nydx /2. The differential power, i.e., the 
average power per unit solid angle, is 


dP avg 

di2 


— r r ■ S max /2 — 


2 4 

32tt 2 eoc 3 


sin 2 6. 


(15.46) 



FIGURE 15.5 Field directions of a spherical wave radiated by a small electric dipole at 
the origin. The Poynting vector is radially outward. One half cycle later the directions of E 
and B at the point shown are opposite to the directions shown. 
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(We have substituted /xq = l/(eo c 2 ).) An important result is that the radiation 
intensity is proportional to cw 4 . If the frequency is doubled, the intensity increases 
by a factor of 16. 

The total power passing through a sphere in the radiation zone is the integral of 
Sr over the surface, or equivalently the integral of d P/d.£l overall directions. The 
relevant solid-angle integral is (f sin 2 9d£2 = 8zr/3. Thus the integrated power, 
averaged over a period of oscillation, is 


p = Pi 02 ' 

3Vg 127TC0C 3 


(15.47) 


The result is independent of the radius of the sphere, as expected. 

The angular distribution of the radiation is proportional to sin 2 6, i.e., 

^_Pavg _ 3/avg i 20 ( 15 . 4 8) 

d£2 8tt 

The intensity is independent of the azimuthal angle, as it must be by symmetry. 
Figure 15.6 shows the sin 2 6 distribution in the form of a polar plot. On this plot 
the distance from the origin to a point on the curve is proportional to the intensity 
of radiation in that direction. (6 is the angle with the z axis.) The sin 2 6 distribution 
is characteristic of electric dipole radiation. The intensity is greatest in the plane 
perpendicular to the oscillating moment (9 = n/2)\ and the intensity is 0 in 
the direction of motion of the oscillating charge (9 = 0 or tt). Electromagnetic 
waves are transverse ; that is, the fields oscillate in directions orthogonal to the 
direction of propagation. Current, alternating up and down on the z axis as shown 
in Fig. 15.4, and the varying charge on the spheres, create radiation fields with 
greatest magnitude in the xy plane, and with zero magnitude in the longitudinal 
direction z. 

Electrical engineers conventionally express the power characteristics of an an¬ 
tenna in terms of a quantity called the radiation resistance, or antenna resistance. 
The radiation fields are linear functions of the current /, so the average power 
is proportional to 7^, the mean squared current. For a current 7o sinoj; the rms 


z 



FIGURE 15.6 Angular distribution of the intensity of electric dipole radiation, as a 
polar plot. The distance on the graph from the origin to a point on the curve is proportional 
to the intensity in that direction. 
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current is /o/V2. The antenna resistance is defined by 

Rant = (15.49) 

That is, /^ ns /?ant is the average power radiated by the antenna. (Recall that the 
average power dissipated in a resistor R is I^R.) Rant has units of ohms. The 
radiation resistance of an antenna depends on its geometry, and is a measure of its 
efficiency as a radiator. 

For the Hertzian dipole the power is (15.47) and the rms current is Irms = 
Io/V2 = w<2o/V2, where Qo = po/d. Therefore the antenna resistance is 

(o 2 d 2 1 la 0 9 

Rant = 7 -T = 7-J— (*<*) ’ (15-50) 

6rreoc J 67 tv eo 

where k in the second equality is co/c. The quantity yVoAo — 377 Q is called 
the impedance of free space. Numerically, we calculate 

Rant = (20 Q) (kdj 1 . (15.51) 


The Hertzian dipole is small; we required d A to justify the pointlike source 
approximation leading to (15.47). Therefore kd <K 1 because k = 2rr/A. The 
radiation resistance is small, so a small dipole is not an efficient radiator. For 
example, if d = 0.1 /k then R am is only 0.2 Q. By comparison, the radiation 
resistance of a half-wave antenna is 73 £1 The half-wave antenna has d = A/2, 
and is not accurately described by the pointlike source approximation. For any 
antenna the radiation resistance can be written as times a dimensionless 

factor that is determined by the geometry of the source. 


15.2.2 ■ Atomic Transitions 

Radiation by an atom is quite different from radiation by a macroscopic source. 
An atom in an excited state emits a single photon and jumps to a state of lower 
energy. The quantum process is discrete and probabilistic. We cannot say ex¬ 
actly when the transition will occur, but we can calculate the transition rate 
(= transition probability per unit time) in terms of a transition matrix element. 

For a hydrogen atom, if the transition satisfies the electric-dipole selection 
rules then the relevant transition element is the dipole moment operator ex be¬ 
tween the inital and final atomic states |i) and \f). (x is the electron position with 
respect to the nucleus.) The matrix element may be written as 

(/Mi) = ed/ik 

letting the z axis be in the direction of (/|x|i). Then the transition rate per unit 
solid angle of the emitted photon direction, calculated in quantum theory, is 

( dR\ e 2 d 2 fi co 3 

transition rate = I — =- 7 sin 9, 

\dnj qu (47re 0 )27rfic3 
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where 9 is the angle between the z axis and the photon direction. The photon has 
energy fko so the mean power radiated per unit solid angle is 

(dR \ e 2 d 2 fi u 4 

power radiated = nco I —— 1 = —^ T sin 0. 

V \dS2j qu 8rr 2 e 0 c 3 

This quantum result has the same form as the classical power (15.46) for a dipole 
moment po = led ft. 

The mean energy flux, as a function of direction, is the same in both theories, 
with a suitable matching of the dipole moments. But the nature of the radiation is 
very different in the two theories. It is discrete in quantum theory but continuous in 
classical theory. The radiation from an atomic transition, if observed in a detector, 
is a single packet of energy fuo with a definite direction. In contrast, the radiation 
from a macroscopic antenna arrives at a detector continuously and spreads out 
as a wave moving in all directions from the source. A single photon also has an 
associated wave—a probability wave —but it cannot be detected simultaneously 
at different points like a classical wave. The radiation from a macroscopic an¬ 
tenna is a state with a large and indeterminate number of photons streaming in all 
directions, and this state behaves just like the classical Maxwellian theory. 

The quantum theory of atomic transitions is an important part of electromag¬ 
netism but outside the scope of this book. We return now to the classical theory 
of radiation. 

15.2.3 ■ Magnetic Dipole Radiation 

The radiation fields in (15.35) and (15.39) were derived in a general way, and are 
the dominant fields for a small source (a <?C A) unless the electric dipole moment 
happens to be 0 because of some symmetry of the source. If pit) = 0 (for all t) 
then the dominant radiation may be magnetic dipole, electric quadrupole, or some 
higher multipole, depending on the geometry of the source. For example, consider 
a current loop in the xy plane, with alternating current / (t) = Ioe~ ,cot , as shown 
in Fig. 15.7. This source has zero electric dipole moment, and indeed zero charge 
density. But it radiates because of the varying current. The dominant radiation, if 
the current loop is small compared to A = Inc/co, is magnetic dipole radiation. 
The magnetic moment is m(r) = kIoAe~‘ Ml , where A is the area of the loop. 

The vector potential can be calculated from (15.20). For simplicity we will 
only calculate the fields in the radiation zone. Asymptotically we may approxi¬ 
mate 1 /|x — x'| by \/r, where r = |x| is the radial coordinate of the field point. 
However, we need to be more accurate in evaluating the retarded time. A better 
approximation of |x — x'| is r — x' • ?; then by Taylor series expansion 

J(x', t - |x - x\/c) as J(x\ t r ) + -—- J(x\ tr). (15.52) 

C 3 t r 

Again t r is t — r/c, the retarded time of a source at the origin. In the second term 
of (15.52) 3/3 t r may be replaced by —ia>. To use (15.20) we need the integral of 
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FIGURE 15.7 Magnetic dipole radiator. The magnetic moment is I Ak, where A is the 
area of the planar loop. If I ( t ) oscillates harmonically then electromagnetic waves with the 
same frequency are generated. 


J(x', t') over all x'. The integral of the first term is 0 by (15.33), because p(x, t) = 
0. (This is why we needed a more precise retarded time; the lowest approximation 
is 0.) The integral of the second term is (—ico/c )m x f, by (8.72), which was 
derived from the definition (8.73) of the magnetic dipole moment of a current 
distribution. Thus the asymptotic vector potential is 


A(x, t) ~-m(r r ) x ?. ' (15.53) 

Anrc 

It is convenient to define the xy plane as the plane of the current loop; then using 
polar coordinates 


A(x, t) 


—HoiwloA sin0 


- ia>(t~r/c V 


(15.54) 


The asymptotic vector potential formagnetic dipole radiation is oc mo/c, while 
for electric dipole radiation it is oc po- In radiation from atoms, magnetic dipole 
radiation is weaker than electric dipole radiation because typically mo/c is small 
compared to po. 

The electric field is E = —3A/3 1 = icoA, because V = 0. Thus E ra d is 
azimuthal. In the radiation zone the magnetic field is orthogonal to E and ?, with 
magnitude E/c, so B = r x E/c. Thus B r ad is meridional. The directions of E ra d 
and B r ad are interchanged relative to radiation from an electric dipole. We leave 
it as an exercise 7 to show that the magnetic dipole results are equivalent to the 
following dual transformation of the electric dipole results: 


P0 —* tn$/c, E r ad ~► cB r ad. Brad ~^ E ra d/c, (15.55) 


where mo = hA is the amplitude of oscillation of the magnetic moment. That is, 
starting from the equations for the electric dipole fields, and making the replace¬ 
ments (15.55), yields the magnetic dipole fields. 

7 See Exercise 7. 
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The mean power radiated by the magnetic dipole is 

Pavg = 12 ^’ 


(15.56) 


which is just the dual transformation of the power (15.47) from an oscillating 
electric dipole. The dipole moment for a current loop is mo = Jo A, so the power 
may be written as 


I 2 A 2 k* 

^ aVg ]2jTtE()C ’ 


(15.57) 


where k = to/c. The antenna resistance for an oscillating pointlike magnetic 
dipole is 


R ml = 1- (k 2 a) 2 = (20^2) (k 2 A) 2 . (15.58) 

6 tt V e o ' ' v > 


Let a be the linear dimension of the loop, so that A ex a 2 . Then f? ant is propor¬ 
tional to ( ka ) 4 , whereas for electric dipole radiation /? am is proportional to (ka) 2 . 
Thus, since ka 4 Cl for small sources, radiation from a magnetic dipole is much 
less intense than radiation from an electric dipole, assuming the dimensions and 
currents are comparable. A related result in atomic physics is that magnetic dipole 
transition rates are much smaller than electric dipole. 8 


15.2.4 ■ Complete Fields of a Hertzian Dipole 

In Sec. 15.2.1 we calculated the radiation fields of a Hertzian dipole, i.e., a small 
electric dipole whose direction oscillates on the z axis. These are the asymptotic 
fields propagating away from the dipole. Now we will calculate the complete 
fields, for all positions in space. By small dipole we mean that the size d is much 
less than the wavelength of the radiation, which is the only other scale in the 
problem. 

Our starting point is the exact retarded potential (15.20). Because the source is 
pointlike this reduces to 


a / \ MO dp 

A(x, t) = -— — 
4nr dt 


as in (15.34), where p is the electric dipole moment at the retarded time t — r/c. 
This is the vector potential that we used earlier to calculate the asymptotic fields, 
i.e.,forr » X. Butherewe will not assume that r is large com pared to A.. We just 
assume that r is much larger than the dimensions of the source. 

Let the dipole moment point in the z direction, so that p(?) = p(t)k. Then 


A(x, t ) = - r/c) k, 

4 nr 


See Exercise 9. 
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where p' denotes the derivative of p with respect to its argument. Note that A is 
in the z direction, A 7 k. The magnetic field is 


B = V x A = VA, x k = — 
z 4n 



sin 0 fi. 


(15.59) 


In evaluating (15.59) use spherical coordinates to calculate VA Z , and note that 
? x k = — sin 0 cf). The magnetic field is everywhere azimuthal, curling around 
the dipole axis. 

The electric field can be calculated most easily from the field equation 


3E , 

— =c 2 V x B, 
3 1 


which holds in the vacuum outside the dipole source. Because B = is az¬ 
imuthal the curl in polar coordinates gives 


3E 
3 1 


f 3 , .03. . 

—:-(Brf, sin 0)- (rBA 

r sin0 30 ' ^ ' r 3r V 


IM)C 

4 n 


-[?2cos0(^ + ^ I )+0sin0( / ^ + ^ T + 4-) • 

\r J cr z / cr z c^r) 


This can immediately be integrated in t, and we find the electric field everywhere, 
r.2 r. 


E = 


Mo c 

4 n 


'a, inf. (4 + 4 + 4) +rco s « (4 + 4)1 . (15.60) 

\r J cr l c^r) \r i cr*-) 


Near, intermediate, and far fields. If we examine (15.59) and (15.60) we see 
three regions. For large r the dominant terms are of order p"/r. These are the 
asymptotic radiation fields propagating away from the origin, that we calculated 
before, also called the far fields. They have a relatively simple form: Both fields 
are transverse to the direction of propagation, B in the <fi direction and E in the 0 
direction, with B = E/c. The other limit is small r, where the dominant field is E, 
which is of order p/r 3 . This is called the near field, and it, too, has a simple form. 
This field is precisely the electrostatic field of a point dipole (see (3-101)) with 
instantaneous dipole moment p(t) k. The near field is also a good approximation 
for all r if the dipole moment changes very slowly, so that p' and p" are small, 
which is an example of the quasistatic approximation. Both near and far fields 
have simple forms, but the intermediate field, which interpolates from one to the 
other, is more complicated. 


EXAMPLE 2 A harmonically oscillating dipole moment. Suppose now that 
pit) = poe~ lcot . What are the fields? 
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By our previous results the magnetic field is where 


#0 


MoPo —ia> 
An r 2 



sinPe 


—ia>(t—r/c). 


(15.61) 


and the electric field is EgO + E r r, where 


Eg 


E r 


M0P0C 2 

’ 1 

ico 

CO 2 ' 

sin e e - i(0(t 

An 

r 3 

cr 2 

c 2 r 


HoPoc 2 

4jz 

"2 2 iw" 

r 3 cr 2 

cos 9e 

—iio(t—r/c) 


(15.62) 

(15.63) 


The phase of the oscillation in time is a complicated function of r. It depends on r 
because of the retardation, and because of the factors of i = e' 71 ! 2 in the terms of 
order \/r 2 . Also, we can be more precise about the validity of the near and far 
field approximations in this example. The near zone is the region with r <SC c/oj; 
or, since the wavelength is A. = Inc/co, the near zone is r <K A. Similarly the far 
zone is the region r 2> A.. 

In the far zone E and B are orthogonal, as for a plane wave. There the Poynt- 
ing vector is always radially outward. In the intermediate zone E and B are not 
orthogonal and the Poynting vector has an oscillating component in the 9 direc¬ 
tion, corresponding to field energy moving back and forth along meridians. 

Although the complete fields are complicated they satisfy th t free field equa¬ 
tions. The dipole is pointlike, so there is no charge or current for r > 0. In fact, 
we have previously derived the solution (15.61) to (15.63) from the vacuum field 
equations. 9 


15.3 ■ THE HALF-WAVE LINEAR ANTENNA 

Many radio transmitters are linear antennas with length t equal to a multiple of 
A/2, where A is the wavelength of the radiation. In this section we study the radi¬ 
ation fields and power from a half-wave antenna, t = A/2. Figure 15.8(a) shows 
the antenna schematically. Alternating current /o sin cot feeds into the center of the 
antenna from a transmission line. (There is no radiation from the transmission line 
because its two currents cancel.) Let the z axis be along the antenna, with the cen¬ 
ter of the antenna at z = 0. Let I (z, t) be the current at distance z from the center 
and time t. At z = 0 the antenna current is Zo sin o:t, oscillating harmonically 
between Zo and — Zo. At the ends z = ±f/2 the current must be 0, because there 
is nowhere for the charges to go to or come from. 

9 See Exercise 8. 
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FIGURE 15.8 The half -wave linear antenna, (a) Alternating current is fed into the 
center of the antenna by a transmission line, (b) The electric current I(z, t) of (15.64) as a 
function of z for several values of t. The length of the antenna is A/2. 



A reasonable approximation of the antenna current as a function of z and t is 

/ (z, t) = Iocoskz sin cat, (15.64) 

where k = n/l. This function has the correct behavior at z = 0 and at z = ±1/2. 
Also it is harmonic in time, and all points in the antenna oscillate in phase. The 
function /(z, t) is shown in Fig. 15.8(b). The parameter k is n/l, so that 1=0 
at z = ±i/2. For a half-wave antenna the frequency must match the length of 
the antenna. The length i is A/2, so co = 2nc/X = tic /l. (It also follows that 
k = oo/c.) For example, if the antenna length is 10 m, then the frequency must be 
/ = c/(2l) = 15MFlz. The condition i = A/2 is a resonance condition. If the 
antenna length and the wave length satisfy this relation, the antenna is an efficient 
radiator. 

We shall calculate only the asymptotic fields, i.e., terms of order 1 / r forr i. 
The exact formula for A(x, t) is (15.20). For the one-dimensional current in the 
antenna the current element J(x', t')d?x' must be replaced by zl (z\ t')dz', and 
the integral is over the length of the antenna as illustrated in Fig. 15.9, so 

A(,.» = M r ±±=m dz 'i. <15 . 65) 

J-e/2 4zr 7? 

The distance R from the antenna element dz' to the field point x approaches r in 
the limit r i. The approximation R « r is good enough for the factor l/R 
in the integrand. But for the phase of the current we need a better approximation 
because the antenna length is comparable to the wavelength of the radiation; as 
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FIGURE 15.9 Geometry of the half-wave antenna, x is the field observation point, and 
r = |x|. Asymptotically R « r — z' cos 6, where z! is a point on the antenna. 


z' varies along the length of the antenna the phase of the current varies over a 
significant part of n . By Taylor series expansion 

R = \/r 2 — 2 rz! cos 6 + z' 2 « r — z cosd. 


With these approximations the asymptotic potential is 

r C/2 


un r ,z 

A(x, ~ 4 ^: J / (z',t - (r —z' cos 6)/c ) dz z 
rU 2 

/ cos kz' sin (cut — + kz'cosO) dz 1 z. (15.66) 

d-f/2 


-e/2 

Mo A) r€/2 
4rrr 


The calculation of the z' integral is an interesting exercise in calculus. First, 
change the variable of integration from z! to \j/ = kz' ■ The endpoints of r/r 
are ±7 t/ 2. Then 


Arad( x > 0 


MoA) 

4ltkr 



COS r/r sin(r/ry — q)d\j/ 


z, 


where we have introduced the symbols y = cos ft and q = kr — cot. The integra¬ 
tion range can be separated into two parts, (—7 t/ 2, 0) and (0, tt/ 2); then, with a 
change of variable, the i/r integral becomes 


r r/2 C7T/2 

I cos i/r [sin(i/ry — r/) — sin(i/ry + r/)] d\jt = —2 sinr/ / cosi/rcos(i \ry)dc\r 

Jo Jo 


= — 2 sing 


COS(7T//2) 
1-K 2 " 
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Substituting back y = cos 9 we have 

-un/o cos COS0] 

A ra d(x, t) = ——---- sin(&r - cot) z. (15.67) 

2nkr sin z 9 

Next we determine the fields. The magnetic field is obtained from the curl, 
Brad = V X A ra d = VAj. X Z. 


However, because we are only calculating the asymptotic field, i.e., the radiation, 
we must neglect terms of order 1/r 2 . So, for example, V acting on the factor 
1/r gives terms that are negligible in the radiation zone. Also, V acting on the 9 
dependence is down by a power of r, and so is negligible. The only contribution 
to the field of order 1/r comes from V acting on s'm(kr — cot). Thus 


Brad 


—/u.o/o cos [f cos6>] 
2nkr sin 2 9 


k cos (kr — cot) r x z 


^q/q cos [| cos6>] 
2nr sin# 


cos (kr — cot)4>. 


(15.68) 


The magnetic field is azimuthal. This direction makes sense because the current is 
along the z axis and B curls around the current. The electric field in the radiation 
zone is obtained in the manner of previous examples from the requirement that the 
electromagnetic wave is transverse, with B = E / c; therefore E ra d = cB ra d x f = 
cBrad^- 

The energy flux of the electromagnetic waves radiated by the half-wave an¬ 
tenna is the Poynting vector 


S = 


4n 2 r 


coscos 9) 
sin 9 


cos 2 (kr 


■ cot)r. 


(15.69) 


Figure 15.10 shows the angular distribution of the radiated power, in the form 
of a polar plot. The radiation pattern is qualitatively like a Hertzian dipole, but 
somewhat more focused in the xy plane. 

The average power through a solid angle dQ is S avg • rr 2 dO., and the average 
total power, i.e., integrated over all solid angles, is 


avg 


B-oIqC C* COS 2 COS 9 ) 

- / - d9 

4: r J o sinf? 


(15.70) 


The 9 integral cannot be reduced to elementary functions but the definite inte¬ 
gral can be evaluated numerically; 10 its value is approximately 1.22. The antenna 


to 


'See Exercise 12. 



15.3 The Half-Wave Linear Antenna 


583 


z 



FIGURE 15.10 Angular distribution of the radiation intensity from a half-wave dipole 
antenna, expressed as a polar plot. The dipole moment oscillates in the ±z direction. On the 
graph, the distance from the origin to a point on the curve is proportional to the intensity 
in that direction. 


resistance /? am is defined as P- dVg /1^ s , where /^ ls is /q/2, so 

flam= ^,/^ = 73f2. (15.71) 

2 tt V eo 

Exercise 12 explores the strength of radiation from a half-wave antenna. 


Radio and Cellular Phones 

The radiation and antenna requirements of cellular phones are relatively straight¬ 
forward, although the associated engineering problems, which must be solved to 
make the system operate properly, are very challenging. Cellular phones operate 
in reserved ultrahigh frequency bands near 800 MHz and near 1900 MHz. These 
bands are divided into channels. Each channel has a specific frequency and an as¬ 
sociated frequency width within which the signal transfer between cellular phones 
actually takes place. Originally the signals were frequency modulated (FM), and 
a typical 25 MHz wide band accommodates about 400 channels; but digital mod¬ 
ulation, which is a more complex but more efficient way to send information, is 
now often used. 

A large geographical area is divided into cells, each with its own antenna and 
frequency channels. An important feature of cellular telephony is that the radia¬ 
tion intensity must be low enough so that signals from one cell do not interfere 
with those in other cells; this also implies that all receivers must be sensitive. To 
help avoid interference, adjacent cells use different channels. Each cell uses one- 
seventh of the available channels. Channels are reused by being assigned to many, 
but nonadjacent, cells; every seventh cell has the same channels. Figure 15.11 
shows an idealized arrangement of hexagonal cells, in which cells with the same 
letter are assigned the same channels. Some important technical difficulties are 
assigning channels, maintaining contact between cellular telephones—sometimes 
a continent apart—as they move from cell to cell, and, of course, billing. 
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FIGURE 15.11 Idealized arrangement of cellular phone cells. Cells with the same num¬ 
ber are assigned the same channels. 


15.4 ■ THE LARMOR FORMULA: RADIATION FROM A POINT CHARGE 

Electric charge is a property of electrons and protons. Any radiating current is a 
collection of moving charged particles. Therefore a basic problem in electrody¬ 
namics is to determine the radiation from a single charged particle. 

The charge and current densities associated with a moving charge q are 

p(x,t)=qS 3 (x-x q (t)) (15.72) 

J(x, t) = qy(t)S 3 (x - x q (t)) (15.73) 

where x q (t) is the position of q as a function of time, and v = dx q /dt is its 
velocity. The delta function implies that p(x, t) and J(x, t) vanish everywhere in 
space except at the position of q . Note that the integral of p (x, t) over a volume 
V is equal to q if the charge is inside V and 0 otherwise. Therefore (15.72) is 
the charge density function of a single point charge at x q (t ). Also, the charge and 
current densities must obey the continuity equation, V • J = —dp/dt. The time 
derivative of p(x, t) is, by the chain rule of differentiation, 

^=q(-y(t))-VS 3 (x-x q ). 

It might seem very daring to differentiate such a singular function as the delta 
function, but the derivative is well defined if the delta function is regarded as the 
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limit of a sharply peaked distribution. The divergence of J(x, t) is 

V • J = <?v • V<5 3 (x - Xq) 


so indeed the continuity equation is satisfied. 

Given the trajectory of q, the potentials associated with q may be calculated by 
evaluating the retarded potentials (15.19) and (15.20) for the densities in (15.72) 
and (15.73). In a later section (Sec. 15.6) we shall investigate the exact fields of q. 
But in this section our goal is simpler: To determine only the radiation fields, i.e., 
the asymptotic fields propagating away from q as a wave. Furthermore, we will 
assume that the motion of q is nonrelativistic, |v| c. 

The radiation fields can be deduced from (15.35) and (15.39), which are the 
asymptotic fields for a small charge and current distribution. For the case of an 
isolated moving charge q. for which the current density is (15.73), the dipole mo¬ 
ment is p(?) = qXq (t), and its derivative is dp/dt = q\. Therefore in (15.35) and 
(15.39), d 2 p/dt 2 is qa, where a = dv/dt is the acceleration of q. So, applying 
the general formulas to the special case of an accelerating particle, the radiation 
fields are 

„ -a 0 q R x a ,, 

Brad x, t ) = -pz- —— , 15.74 

4nc R 

Erad(x, t ) = cBrad X R. (15.75) 

Here R is the vector from the position of q at the retarded time t r to the field 
point x; also, R = |R| and R = R/R. As usual the radiation fields are inversely 
proportional to the distance from the source. The retarded time t r is determined 
by an implicit equation. 


t r = t - \x-x q (t r )\/c. 


(15.76) 


Also, in (15.74) and (15.75) the acceleration a is evaluated at the retarded time, 
because in the general formulas all mention of the source is at the retarded time. 
The above equations for B ra d and E ra( j are valid if |v| « c, a point that will be 
discussed presently. 

If the charge q is moving with constant velocity (a = 0) then the radiation 
fields are 0. This is necessary from the theory of relativity: In the rest frame there 
are no propagating waves produced by the charge, so in any Lorentz frame there 
can be no propagating waves. 

The Poynting vector in the radiation zone is S = E ra d x B ra d//io- Substituting 
the radiation fields, and simplifying the cross products we obtain 


S = - 

167r 2 eoc 3 /? 2 



(15.77) 


(We have substituted jM) = 1 /(eoc 2 ).) The energy flux direction is radially away 
from the position x q (t r ) of q at the retarded time. The total power of electromag- 
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netic radiation passing through a sphere of radius R at time t, the center of the 
sphere being at the position x q (t r ), is 

P = <£ S • RR 2 dQ. (15.78) 


This P is the instantaneous power that was radiated by q at the retarded time t r . 
The integral over solid angles is computed most conveniently by setting up a co¬ 
ordinate system with the z axis in the direction of a; then a 2 — (a • R ) 2 is a 2 sin 2 6, 
and the relevant solid-angle integral is / sin 2 6 dSl = 8jt/ 3. Our final result, then, 
for the instantaneous power radiated by a nonrelativistic point charge is 


1 2 q 2 a 2 

An€Q 3 c 3 


(15.79) 


This celebrated formula was derived by Larmor in 1897. 

The Larmor formula is valid for nonrelativistic motion of q\ that is, in the limit 
u c. Where did this condition enter the derivation of (15.79)? The general 
formulas (15.35) and (15.39) for the asymptotic fields of a localized distribution 
of charge and current were originally derived for a pointlike source, so that the 
retarded time may be approximated as a constant throughout the source. More 
precisely, the condition for validity of those equations is that £ <<C c r, where i is 
the size of the source and r is the time constant of the variation—a characteristic 
time over which the charge density changes significantly. For example, for an 
oscillating source r is the period of oscillation and (15.35) is valid iff «A. In 
the case of a radiating particle the time Z/v plays the role of the characteristic 
time r, so the condition for (15.35) and (15.39) to be accurate is v <SC c. 

The Larmor formula emphasizes that acceleration of a charged particle pro¬ 
duces radiation. The accelerating particle must either lose energy, or else an ex¬ 
ternal agent must do work on the particle to maintain the particle energy. The next 
example shows that a classical model of atomic structure, based on planetary or¬ 
bits of electrons, is not physically reasonable because of energy loss by radiation. 


EXAMPLE 3 Instability of the classical atom. A classical, “planetary” model of 
a hydrogen atom has an electron in a circular orbit of radius r around the nucleus. 
Neglecting radiation, the equation of motion is 


2 2 
m v e 

r Aiteor 2 


and the particle energy is 


E = -mv 
2 


—e 


4treot 8jtcor 


(15.80) 


(15.81) 



15.4 The Larmor Formula: Radiation from a Point Charge 


587 



FIGURE 15.12 Example 3. Instability of the classical atom. The circular orbit is un¬ 
stable because the electron radiates. As it loses energy, the electron spirals into the nucleus. 


Now take into account the radiation. The electron loses energy, and spirals into 
the nucleus as illustrated in Fig. 15.12. How long does it take for the electron to 
hit the nucleus? 

The electron loses energy slowly enough that the orbitremains almost circular, 
and we may continue to use (15.80) and (15.81) for the electron orbit as the elec¬ 
tron spirals in. 11 By conservation of energy the electron must lose orbital energy 
at the same rate that it radiates electromagnetic waves; so, since the latter rate is 
the Larmor power, 


dE —1 2 e 2 a 2 

— = —P = ——5— 

dt 4;reo dr* 


(15.82) 


where E is the particle energy (15.81). The acceleration in circular motion is 
a = v 2 /r, and we can eliminate v 2 using (15.80). This leads to an equation for 
the decreasing radius r(t), 


dr 4 cr 2 

dt 3 r 2 


(15.83) 


We have introduced the constant r e , called the classical radius of the electron, 
defined by 


r e 


-=- = 2.8 x 10 -15 m. 

4jreo me 2 


(15.84) 


It is left as an exercise 12 to show that the solution of this differential equation is 

"See Exercise 13. 

12 See Exercise 14. 
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r(t) = (r 0 3 -4cr e 2 r) 1/3 , (15.85) 

where ro is the orbit radius at t =0. The electron hits the nucleus at time t = 
r^/(4cr 2 ). This time is of order 10 _11 s. 

The instability of the classical atom is an example of the breakdown of classical 
physics at the atomic scale. What makes the atom stable in reality, i.e., in the 
quantum world, is that there is a lower bound on the electron orbital energy, of 
— 13.6eV in a hydrogen atom. The electron cannot radiate from this state because 
it has no lower energy state into which it can go. 


EXAMPLE 4 An a particle with energy 1 Mev collides head-on with a uranium 
nucleus. How much of the initial kinetic energy is converted to energy of radiated 
electromagnetic waves? 

The energy radiated is small, so a good approximation is to apply the Larmor 
formula to the trajectory that is calculated ignoring radiation. The total radiated 
energy is the integral of P over the entire (one-dimensional) trajectory. Let the 
U nucleus (Q = Ze with Z = 92) be at rest at the origin and the a (q = 2e) 
travel along the x axis. The a starts at x = oo with v = —Voo, decelerates on 
its incoming path from oo to xo, the distance of closest approach, then reverses 
direction and accelerates on its outgoing path to x = oo, where v = Uqo. For 
simplicity we’ll neglect the recoil of the U nucleus, which is reasonable because 
the mass is much greater than the a mass m. 

First we must determine Uqo and xo- It is specified that mvl^/2 = 1 Mev; the 
numerical value is r !0O = 6.9 x 10 6 m/s, or = 0.023c. At closest approach 
the kinetic energy is 0 and the total energy is the Coulomb potential energy; thus 
mv£. j/2 = q Q/(4jze()XQ), which may be solved for xo- 

The energy radiated on the inward trajectory equals the energy radiated on the 
outward trajectory, by symmetry, so, using the Larmor formula, the total energy 
radiated is 


f?rad — 2 


P(t)dt = 


a 2 (t)dt. 


(15.86) 


-/outward 3 tt €()C J outward 

Or, changing the variable of integration from t to x, using dt = dx/v(x), we have 


roo „2 


u rad — 


3;reo c3 


'■*0 


a z (x) 

v(x) 


dx. 


(15.87) 


The acceleration is given by Newton’s second law, a = qQ/(4n6omx 2 ). The 
velocity may be obtained from conservation of energy, mv^/2 = mv 2 /2 + 
qQ!(4: reox); after some algebra, 


V(x) = Uoo(l - X 0 /x) 1/2 . 


( 15 . 88 ) 
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Substituting these expressions into the integral gives 


C/rad — 


q A Q 2 


48jr 3 eQC 3 m 2 r 00 


^r- 

Voo Jx n 


dx 


4 (1 — Xq / x ) 1 / 2 ’ 


(15.89) 


For evaluation we reduce the integral to nondimensional form by changing the 
variable to £ = xq/x. After some final simplifications the result is 


^rad 


1 

12Z 


mvz 


' v oo 

' C 


u; 


rd$ 


(l 


(15.90) 


where Z = 92. The integral may be determined numerically as 1.07. Thus f/ ra d = 
1.2 x 10~ 8 MeV. Only a very small fraction of the initial kinetic energy is radiated 
away. 


The transition between classical theory and quantum electrodynamics is not 
clear cut, and there are essentially microscopic phenomena to which classical 
models apply surprisingly well. Example 4 is such a case, and others are treated 
by Jackson. 13 - 14 


15.5 ■ CLASSICAL ELECTRON THEORY OF LIGHT SCATTERING 

When an electromagnetic wave impinges upon a system of charges, they acceler¬ 
ate in response to the electric and magnetic forces, and radiate. Energy from the 
incident wave is transferred to an outgoing spherical wave centered at the radi¬ 
ating charges. This phenomenon is a kind of scattering because light intensity is 
taken from the incident direction and transferred to other directions. For example, 
if a pulse of light traveling in the z direction hits a hydrogen atom at the origin, 
there will be an outgoing spherical pulse of light—the scattered light—with some 
angular distribution of intensity. In the classical picture energy is transferred from 
the incident wave to the electron, and simultaneously from the electron to the 
scattered wave. (Radiation by the proton is negligible, as we’ll see later in the 
discussion.) The goal of a theory of light scattering is to calculate the scattering 
cross section, defined below. Although quantum theory is necessary for a com¬ 
plete description of the interaction between photons and atoms, some interesting 
aspects of this physics can be explored using the classical theory. The classical 
theory is an important first step toward understanding the scattering of light by 
electrons, atoms, or molecules. 

As a simple model, we shall describe an atomic electron as a point charge 
—e bound to the origin by a Hooke’s law force. We used this classical electron 

13 J. D. Jackson, Classical Electrodynamics, Chapter 15. 

14 Another example is found inD. R. Stump andG. L. Pollack, “Radiation from a neutron in a magnetic 
field,” Eur. J. Phys. 19, 59 (1998). 
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eEoco 2 m (wq — w 2 ) 
m 2 (w^ — w 2 ) 2 + y 2 w 2 

(15.94) 

c eEoyco 3 

m 2 (w^ — w 2 )" + y 2 w 2 

(15.95) 


The Lannor power is proportional to a 2 . To calculate the average power , averaged 
over a period of oscillation, note that the average of cos 2 cot or sin 2 cot is 5 , while 
that of cos cot sin cot is 0. The average power radiated by the electron is 


_ e 2 (C\ + C 2 ) _ e 4 E 2 co 4 

avg 127reoc 3 127reoc 3 [w 2 (wq — w 2 ) 2 + y 2 w 2 ] 


(15.96) 


Note that the radiated power is proportional to Eq, the reason is that the elec¬ 
tron is accelerated by the electric field of the incident light, so a oc Eq. Therefore 
the power is proportional to the incident intensity. The scattering cross section o 
is defined as the ratio of the radiated power to the incident power per unit area, 
i.e., to the incident intensity; 


a — 



(15.97) 


where Sj nc is the average of the Poynting vector of the incident wave. The cross 
section has units of area, and is a measure of the scattering efficiency of the atom. 
For the incident plane wave Sj nc is (qcE^/2. Therefore the cross section for light 
scattering by the bound electron in this classical theory is 


8 nr 2 co 4 

3 (wj - w 2 ) 2 + (yw/m ) 2 


(15.98) 


Again r e is the parameter e 2 /(Aneomc 2 ), called the classical radius of the elec¬ 
tron, defined earlier in (15.84). 

Figure 15.14 is a graph of a versus co. In this graph y has been set rather 
arbitrarily to 0.1 mcoo, for illustration purposes. There are three distinct regions: 
co wo, co ~ wo, and w wo. At low frequencies, w <JC wo, the cross section 
approaches the Rayleigh cross section 


^Rayleigh — 



(15.99) 


In this limit the cross section varies as co 4 , or A 4 where X is the wavelength. Long 
waves are scattered less than short waves, and the dependence on X is strong. 
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cr 



FIGURE 15.14 The cross section a (co) f or scattering of light by a bound electron, cal¬ 
culated from the classical theory, as a function of the angular frequency co of the light, coq 
is the natural frequency */K Jm. 


Rayleigh scattering explains why the sky is blue. The sky is a big light, pro¬ 
duced by scattering of sunlight from atmospheric molecules, predominantly N 2 
and O 2 . A characteristic natural frequency coo for the electrons in an atom or 
molecule might be in the ultraviolet, so the Rayleigh cross section applies to scat¬ 
tering of visible light. The incident sunlight is white—a uniform superposition 
of visible wavelengths. But the sky is more intense at short wavelengths (violet 
and blue) than long (orange and red) because the cross section increases with 
frequency; the sky is blue. In contrast, clouds are white. A cloud consists of liq¬ 
uid droplets of water, which are much larger than a wavelength of light. A water 
droplet scatters light as a macroscopic dielectric, i.e., by reflection and refraction 
at the surface, rather than by radiation by a single electron. This scattering is ap¬ 
proximately independent of frequency so the scattered light from a cloud is white, 
like the incident sunlight. 

Why is URayieigh oc co 4 ? It is because for a harmonic oscillation the acceleration 
goes as co 2 , and the Larmor power goes as the acceleration squared. 

Scattering for co near coo is called resonant scattering. The cross section is 
sharply peaked near coo, because the damping is small, y/m <?C coq. When the 
electron is driven near its natural frequency, the motion and acceleration are large, 
leading to a large cross section. This case is analogous to resonance fluorescence 
in atoms, or to the absorption line spectrum: Incident light is absorbed if the fre¬ 
quency is near a spectral frequency. 

For high frequencies, co J>> coo, the electron behaves as if it were a free par¬ 
ticle : coo and y/m can be neglected compared to co, so the atomic forces in 
(15.91) are negligible. Then the cross section approaches the Thomson cross sec¬ 
tion 


8 *r 2 


= 6.6 x 10 -29 m 2 , 


^Thomson — 


3 


(15.100) 
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which is the cross section for scattering of light, or photons, from a free electron. 
This cross section is independent of oj. Thomson scattering is observed from 
plasmas. For example, in experimental plasma physics Thomson scattering is 
used as a diagnostic tool for plasma production. Or, the corona of the sun, seen 
during a total eclipse as a large bright region around the sun, is sunlight scattered 
from the tenuous plasma around the sun, which extends to distances comparable 
to the solar radius. 

For very high frequencies, hco ~ me 2 or larger, the classical theory breaks 
down. This range of hard X-ray scattering is called Compton scattering. For high 
frequency, i.e., high photon energy, the scattered light has a different frequency 
from the incident light. The frequency shift depends on the scattering angle in a 
way that can be calculated in the photon theory of light. The existence of a fre¬ 
quency shift shows that the classical theory cannot explain the phenomenon. In 
the classical theory the scattered light is light radiated by the accelerating electron, 
so it has the same frequency as the electron oscillation; and because the electron 
oscillation is driven by the incident light, that is the same frequency as the in¬ 
cident light. Compton’s experiments verified the photon theory of light, because 
the frequency shift measured in the experiment agrees with that predicted by the 
photon theory. 

In quantum theory, Compton scattering is the process y\ + e\ —> Y 2 + e 2 - In 
a frame where the initial electron e\ is at rest, the outgoing photon Y 2 has less 
energy than the incoming photon y \, because of the recoil of the electron. Thus 
Y 2 has a lower frequency than y \. For fuo me 2 the frequency shift is small, and 
the classical theory gives the correct cross section. But for ha> of order me 2 or 
larger the scattering process can only be described by quantum electrodynamics. 


15.6 ■ COMPLETE FIELDS OF A POINT CHARGE: 
THE LIENARD-WIECHERT POTENTIALS 


We end this book with a very basic problem—to determine the complete electro¬ 
magnetic field of a charged particle in arbitrary motion. If the particle is at rest 
the field is just the electrostatic Coulomb field. If the particle moves with constant 
velocity, the field can be calculated by the Lorentz transformation of the static 
field. But what if the particle moves on a trajectory x q (t), with accelerations and 
decelerations, and changes of direction? What is the electromagnetic field then? 

For any charge density, the scalar potential in the Lorentz gauge is 


V (x, t) 


e 0 J 4:tR 


(15.101) 


where R = |x — x'|; this is (15.19). For a point charge q moving on the trajectory 
x q (?) the charge density is 


p(x',?') = qS 3 {x' - Xq(t')) 


(15.102) 
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as we saw before in (15.72). The 6-function requires in (15.101) 

x' = x q (t — |x — x'|/c). (15.103) 

That is, x' is a point on the trajectory at a certain time t r such that 


x ? (*r) = x 9 (r - |x - x 9 (r r )|/c). (15.104) 

The time t r must be determined by the implicit equation 

t T = t- \x- x q (t v )\/c. (15.105) 


Or, in words, t r is the retarded time for the observation point (x, t ) if the particle 
moves on the trajectory x q (t). Then the potential is 


V (x, t) = 


‘XneoRt 


/ 6 3 [x - x q (t 


k'I /c)\ d 3 


(15.106) 


where R q = |x — x q (t r )\. What makes this problem difficult is that the retarded 
time t r depends on the observation point x. 

The integral in (15.106) can be evaluated, because the 6-function picks out one 
point in space, namely x' = x q (t T ). But the evaluation is nontrivial because of the 
complicated way that the 6-function depends on x'. It is tempting, but wrong, to 
say / 6 3 (x 7 — xo )d 3 x' = 1; that evaluation is correct if xo is a constant, but not so 
if xo depends on x'. Before figuring out the 3-dimensional integral, let’s practice 
on the following 1-dimensional integral, 


h 



f(x')\ dx', 


(15.107) 


where f(x') is some well-behaved function. The only way to evaluate this integral 
is to change the variable of integration from x! to x" = x! — fix'). Note that 

dx" = [l - f\x')]dx' (15.108) 


the prime on / meaning the derivative with respect to its argument; thus 


f 00 8(x")dx" 

h ~ J-oo IWVjj 


1 

U -/'(xo)l' 


(15.109) 


In the second equality we have used the defining property of the 6-function: 8(x") 
is an infinitely sharp spike at x" = 0, so / 8{x")g{x")dx" = g(0). In (15.109)xo 
is the value of x', where x" = 0; that is, xo = /(xo). The key to the evaluation of 
1\ is to change the variable of integration to the argument of the 6-function. 

Now consider the 3-dimensional integral 

h = f S 3 [x' -x q (t- \x- x'|/c)] d 3 x', 


(15.110) 
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which appears in (15.106). The key to evaluating I 3 is to change the variable of 
integration to x", the argument of the 6-function, 


x" = x' — x q (f — |x — x'| /c). 


(15.111) 


h = I S 3 (x") 


d 3 x" _ 1 

J(x") “ J( 0) ; 


here J is the Jacobian of the change of variables, 


(15.112) 


/ dx'/ 

J(x)=Det (4 


(15.113) 


By the chain rule of differentiation, the transformation matrix for the change of 
variables is 


dxgi d (t - |x - x'l/c) 


Vi (x -x')j 


C X — X' 


(15.114) 


The Jacobian is to be evaluated at x" = 0, i.e., at x' = x (/ (t r ). Therefore in 
(15.114), x — x' is x — Xq (? r ) = R^. Combining these results we have 


J (0) = Det [Sij - Pinj] 


(15.115) 


where we define 


f3 = \/c and n — R q /R q . 


(15.116) 


The unit vector n points in the direction from the position of q at the retarded 
time to the observation point x. Also, (3 is the velocity in units of c. Finally, a 
short calculation 16 reveals the value of the Jacobian to be 


J( 0) = 1 - fi • A 


(15.117) 


Thus the scalar potential of the point charge q is 


V(x, t) 


Ane^Rq 1 — n • f3 


(15.118) 


See Exercise 20. 
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To find the vector potential, consider (15.20) with J(x', t') = p(x', where 

t' is the retarded time (15.18); thus 


A(x, t) 




q\ 


4jt£qc 2 R (/ 1 — n 


(15.119) 


Again, = x — x q = R q n is the vector from q at the retarded time to the 
observation point; and /3 = v/c is the velocity of q in units of c. In all expressions 
the source variables ( x q and v) are evaluated at the retarded time t r defined in 
(15.105). The potentials (15.118) and (15.119) are called the Lienard-Wiechert 
potentials. 


15.6.1 ■ A Charge with Constant Velocity 

An important special case is for constant velocity v = v<). We can calculate the 
potential and fields for this case in another way, by making the Lorentz trans¬ 
formation from the rest frame of q (the frame T' in which the potentials are 
V' = q/(4jieor') and A' = 0) to the observation frame in which q has velocity 
vo- This method only works for constant velocity. On the other hand, the Lienard- 
Wiechert potentials are general, so they must give the same answer for this special 
case, as we shall now verify. 

For convenience set up a coordinate system with the z axis in the direction of 
motion, so vo = uok; and let the origin be the position of q at t = 0. Then the 
trajectory is \ q ( t ) = uofk. The distance between q at the observation time t and 
the observation point x at time t is r = sjx 2 + y 2 + (z — not) 2 - The geometry is 
shown in Fig. 15.15. 

The scalar potential V(x, t) in (15.118) is expressed in terms of the position of 
q at the retarded time t r . To compare with the potential calculated by the Lorentz 
transformation, we must rewrite V (x, t) in terms of the position of q at the ob¬ 
servation time t. Refer to Fig. 15.15. P is the observation point at (x , y, z). Q is 
the position of q at the observation time, (0, 0, uof); and Q r is the position of q 
at the retarded time, (0, 0, i>of r ). The distance QQ r is vo(t — t r ) = fiR q , where 
jS = vo/c. The distance QN is z — vot, and the distance NP is yjx 2 + y 2 . Now 
here is the crucial step in the analysis: The distance Q r M can be written as 


Q r M = fiR q cos 9 = R q h ■ /3; 


Therefore the quantity R q (1 — n • /3), which appears in the Lienard-Wiechert 
potentials, is 


/^(l — n • f3) = the distance MP. (15.120) 

But the distance MP can be determined using the right triangles QPM, QPN, and 
Q r PN: 
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z 



FIGURE 15.15 Charged particle with constant velocity. The charge moves on the z 
axis with velocity uok. P is the observation point at (x, y, z). Q is the position of q at the 
observation time, (0, 0, not); and Q r is the position of q at the retarded time, (0, 0, VQt r ). 
Rq is the distance Q r P. 


MP = Jr 2 - (PRq sin6>) 2 

= \/x 2 +y 2 + (z- not) 2 - p 2 (x 2 + y 2 ) 

= ~Jx 2 + y 2 + y 2 (z - vot) 2 (15.121) 

Y V 


where y = 1 /y'T — ft 2 is the familiar y parameter in the Lorentz transformation. 
Substituting this result for MP into (15.118) we find 


V(x, t) = 


4 ™0 [jr 2 + y 2 + y 2 (z — vo f) 2 ] 


1/2 ■ 


(15.122) 


Similarly, 


A(x, t) = 


yv ok 


47T60C 2 ^2 -I- -y2 _|_ y 2( z _ not) 2 ] 


1/2 ■ 


(15.123) 


The same potentials result from making the Lorentz transformation from the rest 
frame of q to the observation frame, because A IJ - = (V/c, A) is a Lorentz 4- 
vector. The corresponding fields E(x, t) and B(x, t) are the same fields of a par- 
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15.6.2 


tide with constant velocity that we calculated by the Lorentz transformation in 
Sec. 12.5. 

A charged particle moving with constant velocity does not radiate electromag¬ 
netic waves. The Poynting vector can be analyzed to show that the field energy is 
carried along with the particle. 17 


The Complete Fields 

Now we return to the general case. We have the potentials (15.118) and (15.119). 
To calculate the fields we only need to evaluate some derivatives 

E = —VV — 3A/3r and B = V x A. (15.124) 


However, calculating these derivatives is fiendishly difficult because the retarded 
time t r depends on x and t. In other words, V may act on the time dependence of 
R 9 = x — x q (f r ) or /3(f r ). For example, consider the quantity V? r . From (15.105) 


thus 


Vf r 


1 

-VR, 

c 


<7 


1 (Xj ~X qi ) 
C Ix-x^l 


V (*/ - Xgi) 


= --[n-n-vVr r ]; 




c(l-h-P) 


and VR ’ = rrira' 


(15.125) 


Also, even more complicated, V may act on n = R q /R q . 

The result of the very intricate calculations of the derivatives in (15.124) is, for 
the electric field. 


E(x t) = £ (n-/3)(l~^ 2 ) | q ii x [(n - 0) x a] 

47Te 0 R% (l — n • /3) 3 4neoc 2 R q (i-fi./3) 3 

(15.126) 

wherea = d\/dt is the acceleration of the particle; and the result for the magnetic 
field is very simply related. 


B(x, t) = n x E(x, t)/c. (15.127) 

The first term in (15.126) is proportional to R~ 2 , like the field of a static charge. 
If q is static (/3 = 0) then this first term is just the Coulomb field. This term is 
negligible at large R q , and is not part of a propagating wave. The second term in 

17 


See Exercise 21. 
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(15.126) is proportional to R~ l and is nonzero if the charge accelerates. Asymp¬ 
totically this term, together with the associated term in B, is the radiation field, a 
wave propagating away from the charge. 

For nonrelativistic velocities, fi is much less than 1 and many factors in 
(15.126) simplify. In particular, if yS <gC 1 then the asymptotic fields at large 
distances (r = |x| —»• oo) are 


Erad 


qn x (n x a) 
4neoc 2 r 


for /8 «; 1, 


Brad 


—fioqn x a 
4trcr 


for P « 1. 


(15.128) 

(15.129) 


(Asymptotically, R q ~ r.) These fields agree with the radiation fields of a nonrel¬ 
ativistic charge that we derived in Sec. 15.4, which led to the Larmor formula for 
the radiated power. 


15.6.3 ■ Generalization of the Larmor Formula 

Knowing the exact fields (15.126) and (15.127) we can now calculate the power 
radiated by an accelerating charge moving with arbitrary velocity. For nonrel¬ 
ativistic motion the power is given by the Larmor formula (15.79). The exact 
formula is a generalization of that result. 

The calculations that must be done are first to find the asymptotic fields for 
large r = |x|, and then to integrate the Poynting vector over a sphere in the 
radiation zone. Rather than go through the details of these long calculations, it 
will be more interesting to consider one special case, and then state and analyze 
the general result. 


Acceleration Parallel to Velocity 

As a special case, suppose a and v are in the same direction; this is the case if q 
moves along a straight line. Then /3 x a = 0 and the asymptotic electric field is 


„ n x (n x a) 

i^rad — , -1 -i 

47reo c L r k j 


(15.130) 


where 


k = 1 — n • /3. 


(15.131) 


The asymptotic magnetic field B rac j is n x E ra d/c- The Poynting vector is 


S — Erad * ®rad — 
Mo 



(15.132) 


Note thatthe energy flux is directed away from the retarded position of the charge. 
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Now, from (15.130) 


4d = 


( q V 

\4^e 0 c 2 r/ 


(a 2 — (n • a) 2 ) 


(15.133) 


Thus the energy that passes through a sphere of radius r during a small time 
interval dt is 


dU = 


—gl— [ 


a 2 — (n • a) 2 


d£l dt. 


(15.134) 


The energy in (15.134) was emitted by the particle during the time interval dt r , 
where t r is the retarded time. Equation (15.105), which relates the times t and t r , 
implies 

dt r = dt ---— • d lx — Xo(r r )l = dt + n • /3dtp, (15.135) 

c |x - \ q \ L 


or, rearranging, 


dt = (1 — n ■ (3)dt r = Kdt r . 


(15.136) 


Therefore the particle’s rate of energy emission, per steradian of the radiation 
direction, is 


d Trad 
dQ 


d 2 U 


a 2 — (n • a) 2 


dQdt r 16;r 2 eoc 3 k 5 


(15.137) 


To analyze the angular distribution, let 6 be the angle between n and the veloc¬ 
ity or acceleration. For example, set up a coordinate system with q at the origin 
and the velocity in the z direction; 6 is the polar angle of the field point. Then the 
differential power as a function of 0 is 


d Trad 

da 


1 


2 2 

q a 


sin 2 6 


47TC0 4jtc 3 (1 -J0COS0) 5 ’ 


(15.138) 


Figure 15.16 shows the angular distribution of the intensity in the form of a po¬ 
lar plot, for = 0.1, p = 0.3, and (i = 0.5. For small (i the distribution is 
approximately sin 2 6 . which is symmetric for 0 < tt/ 2 and 9 > tt/2. As f) ap¬ 
proaches 1 the distribution becomes peaked forward, with maximum intensity in 
a small angular ring around the path of the particle. 

The total power of electromagnetic radiation from the particle is 


T*rad — 


r dP* ad 

o d£l 


2 tr sin OdO = 


1 2 q l a 


2„2 


1 


47TC0 3c 3 




3 • 


(15.139) 


For /S <SC 1 the result approaches the Farmor formula. For fi approaching 1, the 
power for a given acceleration is much larger than Farmor’s formula. 
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FIGURE 15.16 Polar plot of the intensity of radiation from a particle with accelera¬ 
tion and velocity in the same direction. The particle moves along the z axis. The power 
distribution is shown for jS = 0.1, 0.3 and 0.5. 


Lienard's Formula 

The results of the previous section were for a special case, in which the acceler¬ 
ation and velocity of the particle are in the same direction. The general result for 
the total power radiated by a charged particle is 18 


1 2 q 2 a 2 (1 — /3 2 sin 2 1 /r) 
md 4 jt6o 3c 3 (1 — /8 2 ) 3 


(15.140) 


Here xj/ is the angle between the velocity and acceleration. This result was first 
derived by Lienard in 1898. 


EXAMPLE 5 Synchrotron radiation. For circular motion the acceleration is 
centripetal, orthogonal to the velocity, so xj/ = n /2. The acceleration is a = v 2 /R, 
where R is the radius of the orbit, so by Lienard’s formula the power is 


1 2 q 2 v 4 

rad 47reo 3c 3 /? 2 (1 — ft 2 ) 2 


(15.141) 


For example, consider an electron (q = — e) circulating in a synchrotron of 
radius R and magnetic field B . The relativistic equation of motion is 


= = eBR. (15.142) 

v 2 /c 2 

18 To derive (15.140) by integrating the Poynting vector is quite a challenge. (Using an algebraic 
manipulation program is a big help.) It can also be derived by Lorentz transformation of the Larmor 
formula; see Heald and Marion, Sec. 14.9. 
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In the high-energy limit we have v c and 

1 

\ — P 2 \ me ) 

Thus the radiated power approaches 

1 2 e 4 B 2 y 2 

rad 47reo 3m 2 c 

where y = 1/^1 — p 2 . Numerically, 

/ D \ 2 

Pnd = (9.9 x 10 4 eV/s) I — j y 2 (high energy limit). (15.145) 

An electron in a synchrotron loses a lot of energy by radiation. 19 Therefore the 
next high-energy electron accelerator, which will be built sometime in the future 
to study elementary particles and their interactions, will be a linear accelerator 
rather than a synchrotron. Proton accelerators have no problem with synchrotron 
radiation at current energies. The power of synchrotron radiation is proportional 
to y 2 /m 2 , so the power radiated by a proton is less than 10 11 times that from an 
electron of the same energy. 


(15.143) 


(15.144) 


Applications of synchrotron radiation. Synchrotron radiation is not just an un¬ 
wanted by-product of high-energy accelerators. Many electron synchrotrons— 
over 20 worldwide with more planned—are in use as high-intensity light sources 
for scientific and engineering studies. The electrons travel in large circular orbits 
at ultra-relativistic speeds, such as 1 — v/c <5x 1CT 9 . The radiation is strongly 
concentrated in the direction of motion, like a searchlight beam. The radiation 
extends over a broad range of frequencies, and is intense, collimated, and po¬ 
larized. With monochromators, radiation studies can be carried out from 0.01 eV 
(far infrared) to above 1 GeV. Synchrotron radiation is used for research on atoms, 
molecules, solids, X-ray diffraction studies of protein structure, and X-ray lithog¬ 
raphy. 

The path followed by a charged particle in a synchrotron is controlled by mag¬ 
netic fields. In addition to the main magnets that determine the overall circular tra¬ 
jectory, other magnets, called wiggler and undulator magnets, cause short-range 
accelerations of the particles. The purpose of these magnets is to enrich the spec¬ 
trum of emitted radiation so that it can be used to probe efficiently the materials 
or processes studied by scattering of the synchrotron light. 

19 


'See Exercise 23. 
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FURTHER READING 

M. A. Heald and J. B. Marion, Classical Electromagnetic Radiation, 3rd ed. (Saunders, 
Ft. Worth, 1995). This book is a thorough treatment of waves and radiation in classical 
electromagnetism, at the intermediate level. 

W. L. Stutzmann and G. A. Thiele, Antenna Theory and Design (Wiley, New York, 1981). 
This book is intended for students of electrical engineering, and it has many interesting 
examples of the physics of antennas. 

The Feynman Lectures on Physics, Vol. II, Sec. 21. Feynman discusses interesting equa¬ 
tions for the fields of a charged particle with arbitrary motion, called the Heaviside- 
Feynman equations. In the notation of Sec. 15.6.2, these equations are 


q 

n 

+ Rq d \ 


\+~ d — 

47T6Q 

A 

+ —j t \ 

U v 

' c 2 rf/ 2 


and 


B(x, t) = n x E(x, t)/c. 

A derivation and discussion are given in A. R. Janah, T. Padmanabhan, and T. P. Singh, 
Am. J. Phys. 56, 1036 (1988). 


EXERCISES 

Sec. 15.1. The Retarded Potentials 

15.1. Show that (15.28) and (15.29) represent plane waves propagating in the +z and — z 
directions at speed c. Show also that the discontinuity in B at the xy plane satisfies 
the appropriate boundary condition. 

15.2. Suppose that at t = 0 a current I is suddenly established in an infinite wire that lies 
on the z axis. What are the resulting eleclric and magnetic fi elds? Show that after a 
long time, t 3> r/c, the magnetic field is the same as the static field of a long wire 
with constant current I. What is the electric field for t S> r/cl [Answer: 

-MO lc 

2jt \Zc 2 t 2 — r 2 

MO / ct 1 
2 ntr y / c 2 t 2_ r 2 i 

15.3. Starting from Maxwell’s equations with sources p(x, t ) and J(x, t) in vacuum, 
show that 


E z (r,t) = 

B^ir, t) = 


, 1 9 2 E 1 3J 

-V-E+^-y =-Vp-Mo — 

c 2 dt l 60 9f 

, 1 3 2 B 

- vb +^=mvxj. 
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In comparison with (15.5) and (15.6) the left-hand sides of these equations are the 
d’Alembertians of the fields themselves rather than of the potentials. The source 
terms, on the right-hand sides, are more complicated than p/e q and /iqJ. By using 
(15.22), the Green’s function of the d’Alembertian, the equations can be solved for 
E and B as integrals involving the retarded source terms, analogous to (15.19) and 
(15.20). The integral expressions, which are known as Jefimenko’s equations, can 
be used to discuss interesting problems in electrodynamics. 20 


Sec. 15.2. Dipole Radiation 

15.4. Calculate the asymptotic electric field E ra( j produced by a small time-dependent 
electric dipole, by differentiating the vector and scalar potentials in the Lorentz 
gauge. 

15.5. For the Hertzian dipole, i.e., p (t') = po cos cot 1 k, show that the asymptotic poten¬ 
tials are 


A ~ 

V ~ 

Verify the Lorentz gauge condition asymptotically. 

15.6. Figure 15.6 shows the angular distribution of electric dipole radiation as a polar 
plot. One-half of the intensity is emitted in the angular range 

jt a n a 

-< 6 < —I—, 

2 2 ~ ~ 2 2 

i.e., in a sector with angular measure a centered at the xy plane. Determine a, 
which is called the half-intensity angle. 

15.7. (a) From the potentials show that the magnetic dipole radiation fields are 


~M0P0 
4jt r 

-MQCPO 


4jt 


<wsin<w(r — r/c) k 

sin<w(/ — r/c). 


/ cos 6 \ 

{— r sir 


Erad(mag.dipole) = - 


P0 d 2 m 
4ncr dt 2 


x r, 


Brad(mag.dipole) = 


MO 

[ f U d2m \ 

d 2 m 

4jrc 2 r 

iTW 

-"dt 2 


(15.146) 

(15.147) 


(b) Show that the dual transformation (15.55) converts the electric dipole radiation 
fields into the magnetic dipole radiation fields. 

15.8. Show that the full fields of a harmonically oscillating pointlike electric dipole are 
identical to the spherical wave discussed in Section 11.5.4. 

15.9. Show for typical atomic dimensions that mo/c <JC po- 


20 O. D. Jefimenko, Electromagnetic Retardation and Theory of Relativity (Electret Scientific Com¬ 
pany, Star City, W.Va., 1997). 
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Sec. 15.3. Half-Wave Antenna 

15.10. The current 1 (z, t) in the half-wave linear antenna is given by (15.64). Determine 
the linear charge density, i.e., the charge per unit length Xiz, t), in the antenna. 
Show that the charge density is maximum at times when the current is 0. What is the 
maximum charge density at the end of the antenna if / 0 = 10 A and a> = 2n MHz? 

15.11. Calculate the half-intensity angle (see Exercise 6) for the half-wave dipole antenna. 

15.12. (a) Use a computer program to evaluate the 6 integral in (15.70). 

(b) An FM radio station has frequency 10 MHz and power 10 kW from a half-wave 
antenna. How long is the antenna? What is the rms current? 


Sec. 15.4. Larmor Formula 


15.13. Use the results of Example 3 to estimate the number of revolutions made by the 
radiating electron in the classical model of the atom as it spirals into the nucleus 
from initial radius ro = 10 -10 m. 

15.14. Derive the solution (15.85) of the differential equation for the orbit radius as a 
function of time in the classical model of an atom. 


15.15. The average power radiated by an oscillating dipole, with dipole moment pit) = 
p o cos cot, is 


Pq m 

1 iTttQcf 


Derive this result from the Larmor formula, treating the dipole as an oscillating 
pair of charges ±q o, which oscillate 180 degrees out of phase with amplitude of 
oscillation d/2. (Note that p o = (]()d.) But be careful! The waves radiated by qo 
and —<7o interfere, so in Larmor’s formula you must add the qa’s and then square, 
rather than adding the squares of the qa’s! 


15.16. (a) Energy loss by radiation is an insignificant contribution to the resistance of 
an ordinary wire. Consider a conduction electron with velocity uq (moving in 
one dimension) that is brought to rest, because of a collision, over a distance D. 
Derive the ratio / of the energy radiated to the initial energy, assuming constant 
deceleration. 

(b) Let vo be a typical Fermi velocity, say for Cu (m Vp = kTp) and D a typical 
lattice dimension, say 4 x 10 -10 m. Calculate /. 


15.17. Suppose the velocity v and acceleration a of a charged particle are collinear. Let 
dP/dn be the differential power radiated, i.e., the power per steradian. Call the 
total power Ep = f(dP/dQfdQ.. 

Use computer graphics to plot the angular distribution of the power 


1 dP 


as a function of the angle 6 with the direction of motion, for these speeds (super¬ 
imposed): 0.01 c, 0.5 c, and 0.99 c. Use either a polar plot or an ordinary function 
plot. 
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Sec. 15.5. Light Scattering 

15.18. Compute the numerical value of the ratio a (violet)/<r(red), where cr (to) is the 
Rayleigh cross section for scattering, by a bound electron, of light with frequency 
co <K COQ. 

15.19. The electron density n in the solar corona is roughly 10 9 cm~^. Let Idirect he the 
intensity of direct sunlight (at the Earth) and T cor0 na the intensity of coronal light. 
Use dimensional analysis to estimate the ratio Icorona/Tdirect- (Hint: The differen¬ 
tial cross section for Thomson scattering is 


da 


^i + cos 2 .) 


Sec. 15.6. Lienard-Wiechert Potentials 

15.20. Evaluate the Jacobian determinant in (15.115). 

15.21. Calculate the Poynting vector and energy density of the electromagnetic field of a 
charged particle moving with constant velocity. Show that the field energy is carried 
along with the particle. 

15.22. Calculate the integral of d P ra( }/dQ for radiation from a charged particle with ac¬ 
celeration and velocity in the same direction. 

15.23. Calculate the energy radiated per revolution by an electron in a synchrotron accel¬ 
erator with magnetic field 1 T if the energy of the electron is 50 GeV. 



APPENDIX 



Electric and Magnetic Units 


In this book we use the MKSA (for meter, kilogram, second, and ampere) system 
of units, also called SI (after Systeme Internationale) units. MKSA units are now 
used in most books on electromagnetism. There are, however, other systems of 
units. Of these, the most common is the cgs (for centimeter, gram, second) sys¬ 
tem, also called Gaussian units. Gaussian units are commonly used in books on 
quantum mechanics. It is often desirable to convert equations and quantities be¬ 
tween these systems. The two tables following are intended to help with making 
these conversions. 
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Appendix A Electric and Magnetic Units 

TABLE I Equations in the MKSA and Gaussian systems 


Quantity 

MKSA 

Gaussian 

Fields 



E 

1 q' r 

qr_ 


47reo r 2 

r 2 

dB 

Ho Idi x “r 

1 Idi x "r 

An r 2 

c r 2 


Maxwell’s Equations (in vacuum) 



V E 

P 

47rp 


<=0 

V B 

0 

0 

V x E 

3B 

1 3B 

-- 

--- 


at 

c at 


3E 

An w 1 3E 

V x B 

MoJ + e 0M0-^- 

—J 

c c 3r 

Maxwell’s Equations (in matter) 

V D 

Pf 

4rrpf 

V B 

0 

0 


3B 

1 3B 

V x E 

— 



at 

c 3t 


. 3D 

4rr . 1 3D 

V x H 

Jf + — 

—Jf H-— 


at 

c c at 

Constitutive relations 

D 

e 0 E + P 

E 4- 4rrP 

H 

—B -M 

B -4rrM 


MO 


Linear media 

D 

eE 

eE 

H 

Ib 

-B 


M 

M 

P 

eoXeE 

XeE 

M 

Xm H 

Xm H 

Lorentz force F 

q (E + v x B) 

q (e + - x B) 


V c ) 

Energy density (vacuum) u 

f° £ 2 +_L S 2 

2 2/xo 

(e 2 + b 2 ) 

&7T V / 

Poynting vector (vacuum) S 

—E x B 

—E xB 

MO 

1 2g 2 a 2 

4rr 

lq 2 a 2 

Larmor power P 

47 T 60 3c 3 

3 c 3 
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TABLE II Conversion factors between MKSA and Gaussian units. 

In the conversion table below, the speed of light c is an important factor. In the table it has 
been approximated as c = 3 x 10 8 m/s = 3x 10 10 cm/s; the approximation is indicated 
by writing (3). For greater accuracy, replace (3) by 2.99792. 


Quantity 

MKSA 

Gaussian 

Length 

1 meter (m) 

10 2 centimeters (cm) 

Mass 

1 kilogram (kg) 

10 3 grams (g) 

Force 

1 newton (N) 

10 5 dynes 

Energy, Work 

1 joule (J) 

10 7 ergs 

Power 

1 watt (W) 

10 7 ergs/second 

Charge q 

1 coulomb (C) 

(3) x 10 9 statcoulombs (esu 4 ) 

Charge density p 

1 C/m 3 

(3) x 10 3 statcoulombs/cm 3 

Current I 

1 ampere (A) 

(3) x 10 9 statamperes 

Current density J 

1 A/m 2 

(3) x 10 5 statamperes/cm 2 

Potential V 

1 volt (V) 

1 

— x 10 z statvolts 
(3) 

Electric field E 

1 V/m 

— x 10 -4 statvolts/cm 
(3) 

Polarization P 

1 C/m 2 

C rs 

(3) x 10 statcoulombs/cm 

Displacement D 

1 C/m 2 

(3) x An x 10 5 statcoulombs/cm 2 

Conductivity a 

1 siemens/m (S/m) 

(3) 2 x 10 9 (seconds) -1 

Capacitance C 

1 farad (F) 

(3) 2 x 10 11 cm 

Magnetic field B 

1 tesla (T) 

10 4 gauss 

Magnetic flux <t> 

1 weber (Wb*) 

10 8 maxwells* 

Magnetic field H 

1 A-tum/m 

An x 10 -3 oersted (Oe) 

Magnetization M 

1 A/m 

10 -3 Oe 

Inductance L 

1 henry (H) 

—x 10 -11 stathenrys (s 2 /cm) 
(3) 2 


^esu is an abbreviation f or electrostatic unit. 
*1 Wb=l T m 2 ; 1 maxwell=l gauss cm 2 


Further Reading 

Thorough discussions of conversions among units are in 

J. D. Jackson, Classical Electrodynamics, 3rd ed. (Wiley, New York, 1999); 

R. K. Wangsness, Electromagnetic Fields, 2nd ed. (Wiley, New York, 1986). 

E. M. Purcell, Electricity and Magnetism, 2nd ed. (McGraw-Hill, New York, 1985). This 
classic book uses Gaussian units. 

For further discussion of units and dimensions see R. T. Birge, Am. Phys. Teacher (now 
Am. J. Phys.) 2, 41 (1934); 3, 102 and 171 (1935). 
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The Helmholtz Theorem 


The field equations of electromagnetism are partial differential equations involv¬ 
ing the curl and divergence of E(x, t ) and B(x, t ). For example, the electric field 
produced by a static charge density p (x) obeys 

VxE = 0 and V • E = p/eo- (B. 1) 

Or, the magnetic field produced by a static current density J(x) obeys 

V x B = fioJ and V • B = 0. (B.2) 

These relations hold at all x and each is a precise statement about the variation of 
the field at x. 

The idea of field theory is that local equations determine the fields globally. 
For example, given a charge at xi and a grounded conducting sphere nearby, the 
differential equations (B.l) should determine the field at a point \2 on the other 
side of the sphere. 

But is a vector function really determined by its curl and divergence? The 
answer is yes, provided the sources are finite in extent. Any real system must be 
finite so the foundation of field theory is solid. 

The Helmholtz theorem states that any vector field can be decomposed into an 
irrotational part (i.e., with curl = 0) and a solenoidal part (i.e., with divergence = 
0), and these components are determined by the divergence and curl of the field, 
respectively. If the field tends to 0 at infinity, faster than r~ l as r —> oo, then the 
decomposition is unique. This abstract mathematical theorem is an important part 
of the foundation of field theory. 


Statement of the Theorem 

Let F(x) be a differentiable vector function, for which the divergence V F = d(x) 
and curl V x F = c(x) are specified at all points in space. To be consistent, 
V • c = 0. Assume d(x) and c(x) approach 0 faster than r~ 2 as r —> oo, and F(x) 
approaches 0 faster than r _1 . Then F(x) may be written in the form 

F(x) = -V^ + V x A, (B.3) 

and the components — and V x A are unique. Explicit functions xlr(x) and 
A(x) for the representation (B.3) are 



d(x')d 3 x' 
4 jt |x — x'| ’ 


(B.4) 
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f c(x')d i x' 

A(x) = / iV—v 

J 4tt |x - x'l 

in which the integrals extend over all space. 

Note that F(x) is expressed in (B.3) as the sum of an irrotational function 
(—Vi J/) and a solenoidal function (V x A). The functions i [r and A are called 
potentials. 

Proof 

Because d(x') and c(x') tend to 0 at oo, the integrals in (B.4) and (B.5) are con¬ 
vergent and so the functions defined in those equations are well defined. To prove 
the theorem we must first verify that V • F is the specified divergence d(x). By 
(B.3), V • F = —V 2 i jr. But 1/ (47r |x — x'|) is the Green’s function of —V 2 (cf. 
Section 3.5) so (B.4) is Green’s solution to Poisson’s equation, — V 2 i fr = d(x). 
That means the divergence of F is d (x), as required. 

Also, we must verify that V x F is the specified curl c(x). By (B.3), 

V x F = V x (V x A) = V(V • A) - V 2 A. (B.6) 

Again, A is Green’s solution to Poisson’s equation, — V 2 A = c. Also, V • A = 0 
by the following calculation: 

V-A = 


In making the last step, the first term in the line above is 0 by Gauss’s theorem 
because c —> 0 on the boundary sphere at oo; the second term is 0 because V • c = 
0. Thus (B.6) implies that the curl of F is c(x), as required. ■ 

Is the decomposition (B.3) unique? This question is the same as asking whether 
we could add to F a function F' with zero curl and zero divergence and that also 
satisfies the boundary condition F' -*■ 0 at oo. There exist functions with VxF' = 
0 and V • F' = 0, the simplest being F' = constant; but they do not approach 0 at 
oo. So the boundary condition that F — > 0 as r —> oo makes (B.3) unique. 

Let’s prove the statement that F'(x) must be 0. We have specified that V xF' = 
0, so F' can be written as F' = V/ for some scalar function /(x). Without loss of 
generality we may demand that / —»■ 0 as r —> oo. (If / approaches a constant C, 
then / — C approaches 0 and has the same gradient as /.) The condition V F' = 0 
requires V 2 / = 0. Now consider the integral 

J s f fV 2 fd 3 x, 


c(x')d 3 x' 


f(v— I— 

J \ 47T |x - x' 

[ ( _v'- - -Vc (x!)d 3 x 

J V 4 jr|x-x'|/ 

/ |- v ' 


c(x') 


4tt lx — x' 




4tt lx - x' 


(B.7) 
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where the integration region is all of space. Of course J = 0 because V 2 / = 0. 
But no w rewrite J as 

J = / [v • (/V/) - (V/) 2 ] d*x = -J (V/) 2 d 3 x; (B.8) 

the first term in square brackets has integral 0 by Gauss’s theorem because 
/V/ —> 0 on the boundary surface at oo. Thus / (V/) 2 d 3 x = 0. But this means 
that V/(x) = 0 for all x, i.e., F'(x) = 0 as claimed. 

In summary, F(x) is determined by its curl and divergence, assuming the func¬ 
tions tend to 0 at infinity. The irrotational component (—Vi J/) depends on the 
divergence d(x) and the solenoidal component (V x A) depends on the curl c(x). 

The simplest application of the Helmholtz theorem is to the field E(x) in elec¬ 
trostatics, for which c = 0 and d = p/eo. But the general theorem (B.3) has a 
myriad of other applications, including magnetostatics, hydrodynamics, aerody¬ 
namics, elasticity, acoustics and seismology. It can also be extended to waves, in 
which —Vi J/ describes a longitudinal wave and V x A a transverse wave. 
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Absorption of light, 513 
Action at a distance, 56, 397 
Accelerators 
betatron, 371 
synchrotron, 601 
Advanced potential, 566 
Aether, 445 

d’Alembert’s equation, 561 
Ampere, definition, 223, 267 
Ampere’s law 
beautiful formal proof, 

277-280 

calculations with, 271-276 
differential form, 268, 323 
for free currents, 323 
integral form, 270, 326 
Ampere-Maxwell law 
covariant form, 461 
in materials, 412 
in vacuum, 399 

Antenna (Radiation) resistance, 
573 

Angle(s) 

Brewster’s, 504 
critical, 491 

incidence, reflection, refraction, 
491 

Anomalous dispersion, 514 
Asymptotic potential, 567, 576 
Atmospheric electricity, 248 
Atomic polarizability, 200 
table, 188 

Atomic transitions, 574 
Attenuation 

electromagnetic wave, 507 
length, 492 

length in biological tissues, 

519 


Barkhausen effect, 343 
Betatron, 371, 391 
Big bang, 443 
Biot-Savart law, 262, 267 
Bohr magneton, 308 
Bohr model of atom, 353 
Bound charge 

physical derivation, 193 
surface density, 192 
volume density, 191 
Bound current, 320-323 
volume density, 316 
Boundary conditions 
on current density, 227 
with dielectrics, 208 
for electric fields, 56, 414, 490 
for magnetic fields, 329-331, 
414, 490 

for magnetic potentials, 331 
rectangular wave guide, 542 
Boundary value problems, 134 
cylindrical boundaries, 161 
with dielectrics, 208 
numerical methods, 172 
rectangular boundaries, 138 
spherical boundaries, 150, 155 
Brewster’s angle, 504, 518 
Bus bar, 273 

Capacitance, 99 
in computers, 204 
relation to resistance, 232 
of a wire, 235 
Capacitor(s) 
cylindrical, 117 
design, 203, 204 
electrolytic, 203 
energy in, 205 


in parallel, 127 
parallel plate, 99 
in series, 127 
spherical, 109 

Cauchy-Riemann equations, 

165 

Cavity definitions 
of electric field, 351 
of magnetic field, 325 
Cavity oscillator, 440 
Cavity resonance, 555f 
Cellular telephones, 523, 583 
cgs (Gaussian) units, 607 
Charge density 
decay time, 239 
line, 50 
surface, 50 
volume, 49 

Charge-current density 
four-vector, 461 
Charged plates (two), 98f 
Child-Langmuir law, 244 
Circuit 
LC, 374 

magnetic, 349, 352 
RC, 240 
resonant, 374 
Circular motion, 256 
Classical atom instability, 586 
Classical model 

conductivity, 236, 247 
diamagnetism, 311, 375 
dispersion, 511 
paramagnetism, 314 
Classical theory of light scattering, 
589 

Clausius-Mossotti equation, 198, 
512 


613 



614 


Index 


Coaxial cable, TEM wave, 55If 
fields, 553 
impedance, 554 
surface charge and current, 554 
Completeness theorem 
Fourier series, 144 
Legendre polynomials, 151 
Complex variables, and Laplace’s 
equation, 164 
Complex wave, 428 
Compton scattering, 593 
Conductance, 228 
Conducting cylinder in a uniform 
field, 120 

Conducting cylinders (two), 122 
Conducting pipe, rectangular, 143 
Conducting sphere 

and point charge, 113, 115 
in a uniform field, 111 
Conductivity 

classical (Drude) model, 236, 
247 

complex, 514 
imaginary, 514, 520 
Conductor(s), 92f 
cavity in, 96 
electrostatic pressure, 95 
good, 508 
grounded, 94 
perfect, 524 

surface charge, 94, 108, 114 
Conformal transformation, 172 
Conjugate functions, 163 
Conjugate points, 114 
Conservation of charge, 7, 226, 

398 

Conservation of energy 
four-momentum, 457, 477 
waves, 503 

Constitutive equation(s), 196, 326 
dielectrics, 196 
magnetic materials, 326 
table, 413 

Continuity equation, 222, 226, 398 
energy-momentum tensor, 478 
integral form, 226 
Conversion of units, 609 
Cosmic microwave background, 
443 


Coulomb gauge, 281 
Coulomb’s law, 3, 44 
tests of, table, 98 
Covariance, 457 
manifest, 457 
Cross product, 13,15 
Curl 

of eleclrostatic field, 54 
as vorticity, 21 
Current, 222 
definition, 223 
as flow of charge, 223 
steady state, direct, 222 
surface, 225 

various devices, table, 238 
Current density 
bound, 316, 320-323, 411 
free, 411 
polarization, 411 
surface, 225 
total, 411 
volume, 224 
Current loop 
full field, 291 
vector potential, 294 
Curvilinear coordinates, 30f 
curl, 32 
divergence, 31 
gradient, 31 
Laplacian, 32 
Cycloid, 260, 298 
cycloidal motion, 258 
Cyclotron, 256 

Cylinder, uniformly magnetized, 
318 

Cylindrical capacitor, 117 
Cylindrical coordinates, 33-35 
table, 34 

Cylindrical symmetry, 116 
angular dependence, 120 

Del operator, identities, 24 
Diamagnetism, 307, 310 
classical model, 311, 350, 375 
Dielectric constant, table, 198 
Dielectric strength, table, 198 
Dielectric sphere 
polarizability, 210 
in a uniform field, 209 


Dielectrics, 186, 485 
energy in, 205 
free charges in, 200 
linear, 197 
model, 207 

Diode, semiconductor, 245 
Diode, vacuum-tube, 241, 244 
current, 244 
space charge, 243 
Dipole (electric), 77, 79 
field of, 78 
induced, 186, 187 
line, 50 
pairs, 215 
potential energy, 81 
potential of, 77 
torque on, 81 

uniformly polarized sphere, 195 
Dipole (magnetic), 284 
dynamics, 290 
field of, 286 
force on, 289 
potential energy of, 288 
torque on, 287 
vector potential, 286 
Dirac delta function, 62 
Disk generator, 358, 370, 389 
Dispersion, 507, 513 
anomalous, 514 
classical model, 511 
in plasma, 514 
Dispersion relation, 433 
in a conductor, 507 
in a plasma, 515 
TE waves, 529 
TEM waves, 526 
TM waves, 538 
for water waves, 520 
Displacement current, 268, 399, 
404 

density, 400 

Displacement vector, 195 
divergence of, 196 
Gauss’s law for, 196 
Divergence, 19 
of displacement vector, 196 
of electric field, 54 
four-vector, 479 
of polarization, 191 
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Divergence (Gauss) theorem, 26 
Doppler shift, 467f, 483 
Dot product, 13, 15 
Drift velocity, 224, 236, 246, 260, 
474 

Drude model, 236, 247 
Dual field tensor, 462 
matrix form, 462 
Dual transformation, 576 
Dynamics (relativistic), 457 

Earth 

magnetic field of, 291, 302 
magnetic moment of, 291 
Eddy currents, 364 
Einstein relativity postulates, 445 
Einstein summation convention, 
15, 453 
Electret, 216 
Electric dipole, 77, 79 
field of, 78 
line, 119 

potential energy, 81 
potential of, 77 
torque on, 81 

Electric dipole radiation, 567f 
fields, 570 
power, 571 
Poynting vector, 570 
Electric field, 46f 
of circular loop, 51 
definition, 46 
of dipole, 78 
of finite straight wire, 50 
of hexagonal array of charges, 
48 

inside a current-carrying wire, 
235 

inside cavities, 96, 351 
lines, 79 

of spherical shell, 52 
Electric field energy, 72 
of two point charges, 74 
Electric motor, 260 
Electric potential, 65f 
of charged circular plate, 70 
of dipole, 77 

of finite straight wire, 69, 71 
ground, 94 


of quadrupole, 80 
uniqueness, 96 
Electric potential in two 
dimensions 

conducting half plane, 167 
fringing field of capacitor, 169 
intersecting conducting plates, 
166 

Electric quadrupole, 79 
potential, 80 
Electromagnet, 348 
Electromagnetic energy density, in 
plane wave, 430 
Electromagnetic field, 397 

angular momentum density, 422 
energy in matter, 419 
energy in vacuum, 418 
history, 4,5 

momentum density, 421 
moving line charge, 473 
moving point charge, 468f 
tensor, 459, 484 
transformation, 464 
Electromagnetic spectrum, 426t 
Electromagnetic wave(s) 
attenuation, 507 
between parallel conducing 
plates, 524 
complex form, 525 
in conductors, 505 
in dielectric, 512 
evanescent, 492 
how they work, 433 
intensity, 488 
plane, 487 

radiation pressure, 430 
seen from moving frame, 467 
speed in dielectric, 486 
spherical, 434 
in vacuum, 423, 425 
wave equation, 424 
wave vector, 490 
Electromotive force (EMF), 356 
back, 372 
induced, 363 
motional, 356, 357 
Electron 
charge, 224 
discovery, 2 


Electron spin resonance, 353 
Electrostatics, 44f 
applications, 82 
cylindrical symmetry, 116, 

120 

general problem, 133 
rectangular symmetry, 98 
spherical symmetry, 107, 110 
Elements, Group IV and V, 245 
Energy 

electric field, 72 
flux, 416 

magnetic field, 382 
particle, 456,482 
wave in conductor, 509 
Energy-momentum flux tensor, 
477 

in terms of electromagnetic 
field, 478 

Energy-momentum four-vector, 
456 

Equation of continuity, 222 
Equations of electromagnetism, 
460t 

Equations of motion, covariant 
form, 460 
Equipotentials, 79 
Ether, 445 

Evanescent wave, 492, 550 
Event, 449 

Exchange interaction, 343 

Faraday cage, 545 
Faraday disk generator, 358 
Faraday’s law, 360 

differential equation form, 

362 

Fermi temperature, 237 
Ferromagnetism, 307, 342 
source of, 343 
Feynman disk paradox, 422 
Field theory, 56 
Force 

between parallel wires, 266 
Coulomb, 44 
Lorentz, 254 

Lorentz transformation of, 475, 
484 

Four-momentum, 456 
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Four-vector, 452 
charge-current density, 460 
contravariant, 454 
covariant, 454 
divergence, 479 
energy-momentum, 456 
Minkowski force, 458 
position-time, 452 
potential, 462 
scalar product, 454, 481 
velocity, 455 
Four-velocity, 455 
Fourier analysis, 433 
Fourier integrals, 563 
Fourier series, 139, 162 
completeness theorem, 144 
finding coefficients, 140 
Frame(s) of reference 
inertial, noninertial, 445 
moving, 448 
rest, laboratory, 448 
Free currents 

in coaxial cable, 334 
in infinite slab, 332 
Fresnel’s equations, 498 
for TE polarization, 500 
for TM polarization, 501 f 

Galilean 

invariant, 448 
principle of relativity, 446 
Galvanometer, ballistic, 389 
Gauge 
choice, 407 
Coulomb, 281,408 
invariance, 406 
Lorentz, 409 
transformations, 406 
Gauge symmetry, 226 
Gauss’s law 

covariant form, 461 
for displacement vector, 196 
integral form, 57 
proof, 57, 65 

and uniformly charged line, 61 
and uniformly charged plane, 61 
and uniformly charged sphere, 
59 

Gauss’s (Divergence) theorem, 26 


Gaussian (cgs) units, 607 
Generator, 369 
Gradient, 18 
of inverse distance, 55 
Grazing incidence, 505 
Green’s function 

of d’Alembert’s equation, 564 
of Helmholtz equation, 562 
of Laplacian, 64 
of Poisson’s equation, 562 
Ground potential, 94 
Group velocity, 488, 508, 519 

Half-wave antenna radiation, 579 
fields, 582 
power, 582 
Poynting vector, 582 
resistance, 583 
Hall effect, 296 
Heaviside-Feynman equations, 

603 

Helical motion, 255 
Helmholtz equation, 542 
Helmholtz theorem, 37, 69, 269, 
362, 406 
Appendix B, 611 
Henry, 377 
definition, 372 

Hertzian dipole radiation, 571 
fields (far, near), 572, 577, 578 
power, 573 
Poynting vector, 572 
resistance, 574 
Hysteresis, 346, 387 

energy loss per cycle, 388 

I versus V characteristic of diode, 
241 

Images 

charge and conducting plane, 
102 

charge and planar dielectric, 

211 

charge and sphere, 113, 115 
charge and two planes, 105-107 
charge and two spheres, 116 
dipole and conducting plane, 
104 

line charge and cylinder, 124 


Index of refraction 
definition, 486 
frequency dependence, 514 
table, 486 
Indices 
Greek, 452 
lowering, raising, 454 
Roman, 459 
Inductance 
mutual, 376 
self-, 372 

Induction, 355, 360 
motor, 261 
Inertial frame 

electromagnetic wave in, 468 
uniform electric field, 466 
Insulators, conductivity, 229 
Interference, 536 
Inverse distance, in terms of 

Legendre polynomials, 159 
Ion rockets, 244 
Ionosphere, 516 
Iterative relaxation, 172 

Jacobian, 595 
Jefimenko’s equations, 603 
Joule’s law, 238 
local form, 238 

Kinematics (relativistic), 455 
Klystron, 523, 556 
Kronecker delta, 16, 52 

Langevin-Debye formula, 190 
Langevin’s formula, 190, 353 
Laplace’s equation 

Cartesian coordinates, 136 
cylindrical coordinates, 159 
spherical coordinates, 147 
Larmor formula, 584 
generalization, 599 
Laser, 431 

LC circuits, 374, 391 
Legendre polynomials, 149, 295 
completeness theorem, 151 
generating function, 157 
inverse distance, 159 
orthogonality, 152 
Legendre's equation, 148 



Index 


617 


Length contraction, 451 
Lenz’s law, 363 

Levi-Civita tensor, 13, 286, 459 
Leyden jar, 204 

Lienard-Wiechert potentials, 593 
scalar, 595 
vector, 596 

Lienard’s formula, 601 
Light, 437 
cone, 481 

Doppler shift, 467f 
Light scattering by electrons, 589, 
591 

classical theory, 589 
Compton scattering, 593 
Rayleigh cross section, 591 
Thomson cross section, 592 
Lightning, 248 
Benjamin Franklin and, 204 
Line charge density, 50 
Line dipole, 119 
Lodestone, 252 

Lorentz-FitzGerald contraction, 
451 

Lorentz force, 254 
from relativity, 475 
relativistic, 458 
Lorentz product, 454 
Lorentz transformation, 448 
boost, 481 

coordinates, 465, 480 
electromagnetic fields, 464f 
energy-momentum, 465 
of force, 475,484 
of four-vectors, 454 
inverse matrix, 454 
matrix, 453 
velocity, 465 

Magnetic braking, 359, 364, 366 
Magnetic circuit, 349, 352 
Magnetic compass, 252 
Magnetic dipole, 284 
dynamics, 290 
field of, 286 
force on, 289 
potential energy of, 288 
torque on, 287 
vector potential of, 286 


Magnetic dipole moment 
arbitrary current distribution, 
285 

planar loop, 286 
Magnetic dipole radiation, 575f 
fields, 576 
power, 577 
Magnetic energy, 382 
astrophysical, 394 
cylindrical solenoid, 383-385 
in Earth's field, 394 
in ferromagnet, 386 
Magnetic field 

circular current loop, 265, 291 
cylindrical solenoid, 275, 276, 
348 

of Earth, 291,302 
inside cavities, 324 
long straight wire, 264, 271 
magnetic dipole, 286 
planar current sheet, 272 
slab of current, 273 
two parallel current sheets, 272 
of wave in conductor, 509 
Magnetic levitation, 367 
Magnetic moment, 285, 286 
table, 309 

Magnetic scalar potential, 335 
Magnetic shielding, 339, 354 
Magnetic susceptibility, 326, 327 
calculation for hydrogen, 328 
calculation for NO, 329 
table, 327 
Magnetic tape, 346 
Magnetism from relativity, 474 
Magnetization, 314 
curves, 345 
definition, 307 
measurement of, 343 
Magnetization field, 410 
Magnetohydrodynamic generator, 
389 

Magnetometer (Hall effect), 296 
Magnetron, 303, 523, 556 
Mass spectrometer, 257,298 
Matrix 

dual field tensor, 462 
electromagnetic field tensor, 
459, 477 


Lorentz transformation, 453 
Maxwell’s equations 
covariant form, 460 
in matter, 400t, 410 
in metals, 506 
in vacuum, 398, 400t 
Mean free path, 236 
Meissner-Ochsenfeld effect, 

368 

Metals, conductivity (table), 

229 

Method of images, 102 
Metric tensor, 478 
Microwave oven, 487, 557 
Microwaves, 523f 
Minkowski 
force, 458 

space-time, 452, 477 
tensor, 457 

Michelson-Morley experiment, 
446 

MKSA units, 607 
Momentum 

density, 477, 479 
electromagnetic, 421 
in plane wave, 430 
relativistic, 456 
Motor, electric, 260 
induction, 261, 370 
Moving frame, 448 
Moving line charge(s) 
fields, 473 

interactions between two, 474 
Moving point charge 
fields, 468f 

fields in covariant form, 484 
Multipole expansion, 75f 
of two point charges, 75 
Muon lifetime, 480 
Mutual inductance, 376 
coaxial current loops, 380 
Neumann’s equation, 377 
transformer, 379 
two solenoids, 392 

Near-field scanning optical 
microscopy, 493 
Normal incidence, 494 
Neumann’s equation, 377 
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Numerical methods, 172 
asymmetric conducting pipe, 
176 

square conducting pipe, 175 

Observer, 448, 482 
Ohm, definition, 228 
Ohm's law, 228 
local form, 230 
Optical fibers, 493 

Parallel line charges (two), 118, 
122 

Parallel plate capacitor, 99 
charging, 401-404, 419 
with dielectric, 202, 212, 

213 

energy, 101 

energy for fixed charge, 206 
energy for fixed voltage, 207 
fringing field, 169 
moving, 483 

Paramagnetism, 307, 313 
statistical mechanics, 314, 350, 
353 

Permeability 

ferromagnetic materials, 346 
vacuum, 263 
Permittivity, 197 
complex, 512 
Phase difference, 509 
Phase velocity, 427, 488, 507 
greater than c, 534 
Plane of incidence, 491 
Plane wave(s), 427 
energy density, 430 
general solution, 431 
momentum density, 430 
Poynting vector, 429 
Plasma 

dispersion, 514 
frequency, 515 
laboratory, 520 
in metals, 520 

Point charge radiation, 584f, 

599f 

fields, 585, 593, 599 
power, 586, 600 
Poynting vector, 585, 599 


Point charge with constant 
velocity, 596f 
complete fields, 598 
Poisson’s equation, 55, 68 
in a dielectric, 200 
Polar molecules, 187, 189 
polarizability, 190 
Polarizability 
atomic, table, 188 
of a dielectric sphere, 210 
polar molecules, 190 
Polarization, 432, 440 
definition, 191 
divergence of, 191 
transverse electric (TE), 498 
transverse magnetic (TM), 500 
Polarization field, 410 
Position-time four-vector, 452 
Potential 

advanced, 566 
asymptotic, 567, 576 
four-vector, 462 
Lienard-Wiechert, 593 
retarded, 561 
scalar, 405 
vector, 405 

Potential energy of magnetic 
dipole, 288 

Potential for point charge, in terms 
of Legendre polynomials, 
159 

Power, various devices, table, 238 
Poynting’s theorem, 416 
Poynting vector, 479 
in dielectric, 488 
electric dipole radiation, 570 
half-wave antenna, 582 
Hertzian dipole radiation, 572 
in matter, 419 
point charge radiation, 585 
TE waves, 532 
TEM waves, 526 
TM waves, 539 
in vacuum, 418 
wire, 420 

Propagation constant 
complex, 507 
good conductor, 508 
Proper time, 455, 483 


Quadrupole focusing, 297 
Quantum electrodynamics (QED), 
6 

and classical electromagnetism, 
7 

Quasistatic approximation, 44, 
380, 402, 469 

Radar, 519, 523 
Radiation, 560f 
classical atom, 586 
electric dipole, 567 
fields, 570 

from collision (Example), 588 
magnetic dipole, 575 
point charge, 584 
resistance, 573, 574, 577, 583 
synchrotron, 601 
Radiation pressure, 430 
on a conductor, 519 
Radiation zone, 568 
Radio, 516 
Rainbow, 519, 522 
Rapidity, 481 
Ray, 536 

Rayleigh scattering, 591 
RC circuit, 240, 251 
Rectangular symmetry, 98 
Rectangular wave guide, 540 
boundary conditions, 542, 

547 

cutoff frequency, 544 
dispersion relation, 544, 548 
TE waves, 541, 546 
TM waves, 547, 548 
Reflection 
angle of, 491 
arbitrary angles, 498 
coefficient, 495 
from a plasma, 515 
law of, 491 
phase change, 495 
total internal, 491 
Reflectivity, 495 
conductor, 509 
normal incidence, 494 
zero, 497 
Refraction 
angle of, 491 
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index of, 491 
law of, 491 
Relativity 

addition of velocities, 452 
and magnetism, 474 
dynamics, 457 
kinematics, 455 
Galilean, 446 
postulates, 445 
transformation, 447 
Resistance, 228 
calculations of, 230f 
relation to capacitance, 232 
Resistivity 
semiconductors, 237 
table, 229 

temperature coefficient, 237 
typical ranges, 229 
Resistors 

fabrication of, 233 
in parallel, in series, 247 
network, 248, 249 
Resonance, 375, 513 
Resonant cavity, 555f 
fields, 556 
frequencies, 556 
Resonant circuits, 374 
Rest energy, 456 
Rest frame, 448, 450 
Rest mass, 456 
Retarded potential, 561 
scalar, 564 
vector, 564 

Retarded time, 564, 568 
implicit equation, 585, 594 
RL circuit, 391 
RLC circuit, 395, 396 
Rowland ring, 343, 395 

St. Elmo’s fire, 121 
Scalar potential, 405 
Lienard-Wiechert, 595 
magnetic, 335 
retarded, 564 

Scalar product of four-vectors, 
454 

Sea water conductivity, 229 
Self-excited dynamo, 359 
Earth’s magnetic field, 360 


Self-inductance, 372 
coaxial cable, 386 
cylindrical solenoid, 373 
isolated wire, 386 
toroidal coil, 373 
Semiconductors, 92 
conductivity, 229 
energy gap, 237 
n-type, 245 
p-type, 245 
resistivity, 237 
Separation of variables, 135 
Cartesian coordinates, 136 
cylindrical coordinates, 160 
spherical coordinates, 147 
SI units, 45 
Appendix A, 607 
Skin depth, 508 
Snell’s law, 491 
Solar corona, 606 
Solenoid, magnetic field, 348 
Space charge, 244 
Spacetime diagram, 481 
Sphere 

in a uniform magnetic field, 
336-339 

uniformly magnetized, 317 
uniformly polarized, 194 
Spherical capacitor, 109 
Spherical coordinates, 35-36, 40 
table, 36 

Spherical shell in a magnetic field, 
339-342 

Spherical symmetry, 107 
angular dependence, 110 
Spherical wave, 434 
asymptotic fields, 436 
electric field of, 435 
magnetic field of, 435 
vector potential, 434 
Square conducting pipe, 
numerical solution, 175 
separation of variables, 138 
SQUID, 255 
Standard model, 5 
Standing wave, TE(n) mode, 529 
Static electricity, 82 
discovery, 2 

Stem-Gerlach experiment, 289 


Stokes’s theorem, 27 
Suffix notation, 9, 14f 
double cross product, 15 
four vectors, 452 
Summation convention, 15, 453 
Superconductivity, 368 
Superposition principle, 46, 282, 
425 

Surface charge on wire with 
current, 234 

Surface charge density, 50 
bound charge, 192 
Surface current, 225 
TE waves, 531 
TM waves, 538 
Susceptibility, 
electric, 197 
magnetic, 326 
magnetic, table, 327 
Synchrotron radiation, 601 

Tachometers, 365 
TE polarization, 498 
TE waves, 528, 549 
cutoff frequency, 531 
dispersion relation, 529 
fields, 529 
group velocity, 531 
phase velocity, 531 
Poynting vector, 532 
rectangular wave guide, 541 
as reflections, 534 
surface current, 531 
TEM waves 
in coaxial cable, 55If 
dispersion relation, 526 
fields, 526 
Poynting vector, 526 
Tensor 

contravariant, 454 
covariant, 454 
dual field, 462 
electromagnetic field, 459, 
484 

energy-momentum flux, 477 
Levi-Civita, 15, 459 
metric, 478 
Tesla, definition, 253 
Thomson scattering, 592, 606 



620 


Index 


Time 

dilation, 450 
interval, 450 
proper, 455, 483 
retarded, 564, 568 
Time constant of decay, 239 
TM polarization, 500 
TM waves, 537, 550 
dispersion relation, 538 
fields, 537 

Poynting vector, 539 
rectangular wave guide, 547 
as reflections, 538 
surface charge, 539 
surface current, 538 
Torque 

electric dipole, 80 
magnetic dipole, 287 
Total internal reflection, 491 
Transformation 
dual, 576 

of fields (relativistic), 464,482 
Galilean, 447 
Lorentz, 448 

velocity (relativistic), 452 
Transformer, 379 
Transmission 
coefficient, 495 
line, 524, 551 
Transmissivity, 495 


Uniformly polarized sphere, 
194 

dipole moment, 195 
Uniqueness theorem, 96, 406 
proof, 134 

Units, Appendix A, 607 

Vector calculus, 18f, 38 
curl, 20 
divergence, 19 
fluid mechanics, 29 
gradient, 18 
Laplacian, 20 
Vector potential, 405 
current loop, 294 
integral form, 282 
Lienard-Wiechert, 596 
magnetic dipole, 286 
magnetostatics, 280 
of bound currents, 316 
retarded, 564 

rotating charged sphere, 282 
Vectors 

algebra, lOf, 13f 
circulation, 16 
covariant, 482 
flux integral, 26 
geometry, 16f 
notation (table), 10 
product identities, 14-16 


Velocity 

addition (Newtonian), 451 
addition (relativistic), 452 
electron drift, 236, 246, 260 
group, 488, 508 
phase, 427, 488, 507 
selector, 258 
terminal, 367 
thermal, 237 

Volume charge density, 49 
bound charge, 191 

Waves, electromagnetic, 423f 
TE, 528 
TEM, 526 
TM, 537 

Wave equation, 424 
in conductors, 506 
Wave guide, 524 
arbitrary shape, 549f 
rectangular, 541 
Wave vector, 490 
Weber, definition, 357 
World line, 483 

Zero reflectivity, 497 
Zonal harmonics, 161 



